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Preface 


The reader of Book 1 has already been introduced to simple functions, to 
their derivatives and integrals, to elementary sequences and series, to the 
foundations of probability and statistics, to vectors and mechanics. Our 
present task is to build upon this base. 

Whatever we might have achieved has been due to joint authorship, 
chapters 10, 13 and 14 having been written by F. J. Budden and chapter 12 
by D. Knighton, whilst L. K. Turner has been responsible for the remain- 
ing chapters and for general editorship. As in Book 1, the treatment within 
each chapter follows a natural progression and the individual teacher or 
reader can exercise very great freedom in deciding how the chapters 
themselves are to be combined. 

Finally, it is a pleasure to thank Mr Michael Spincer and all his colleagues 
at Longmans for their constant support and consideration throughout our 
work, Oxford Illustrators for the quality of their diagrams and the staff of 
William Clowes for the customary excellence of their printing. 


L.K.T. 
Ε.1.8. 
D.K. 
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Notation 


Vectors 


AB, CD, ... 


a, b, 


Go 
Ι 


d 
d 
0 
6 
aa + Bb 
arp 


displacement vectors 

free vectors, or position vectors, 
with moduli a, 6, ¢,... 

unit vectors in the directions 
ὃ... 

vector sum and difference 

unit vectors, with right-hand set of 
mutually perpendicular axes 

vector r with components x, y, Z 


velocity 
acceleration 
velocity of Q relative to P 


acceleration of Q relative to P 


angular velocity 
angular acceleration 


r divides a, δ in the ratio Pp: 


centroid of γε} = 1,..., 7) 


IX 


x NOTATION 


a.b = αὖ cos θ᾿ 


= a,b; + dobo + agbs 
axb= (absin0)fa 

= (apb3 — a3b2)t 

+ (a3b, — a,b3)J 

+ (a;bo — aob,)k 


Mechanics 
F 
R=P+Q 
& 
F,N,R 
μ 
λ 
i= Κι 
tz 
1~| F di 
mv 7 
δ 
W=F.s 
w= |F ds 
Je" 
dw 
P= —=_F 
dt - 


centre of mass 
of m, αὖ με = 1,..., 2) 


scalar product of a, b 


vector product of a, ὃ 


force F 
resultant force R = P+ Q 


acceleration due to gravity 
(g = 9.8 ms? on Earth) 
frictional component (ΕἾ and 
normal component (N) of 
resultant reaction (R) 
coefficient of friction = <u 
angle of friction (= tan~! p) 
impulse of constant force F in time ¢ 


impulse of force F in interval (¢,, t,) 


momentum of mass m moving with 
velocity v 
coefficient of restitution (or resilience) 


work done by constant force F in 
displacement s 


work done by variable force F 


kinetic energy of mass m moving 
with velocity v 


power of force F 
(= rate of doing work) 


stiffness of a spring 


elastic modulus of spring or string 
of natural length / 


χλχ = -—w*°x 

x? = w?(a? — x?) 
x = asin (wt + 8) 
στ ΧΕ 
l= Σ mr 


G(= 6.67 x 107!! in SI) 


NOTATION Χὶ 

standard equations of simple 
harmonic motion with amplitude 
a and period πίω 

moment about O of a force F acting 
at point r 

moment of inertia 

constant of gravitation 


Logarithmic, exponential, and hyperbolic functions 


* di 
Inx = logex = |S (x > 0) 


1 
e (= 2.718) 
δ᾽ (or exp x) 
sinh x = $(e* — ε΄ ἢ 
cosh x = 4(e* + e7*) 
sinh~! x = In [x + J/(x? + 1)] 
cosh™' x = In [x + JS - 1)] 


Coefficients in binomial serves 


r! 


Complex numbers 
z=x+y 
= r(cos θ + isin θ) = rcis θ 
R(z) =x 
I(z) = y 
lel =r = V(x? + ἢ 
argz= θ(--π <0 - πὶ 
z* =x -- yw = rcis (—9@) 


Probability and statistics 


p= Ε[Χ] = Lp, 


o = ΠΧ = EL(x — μ)" 
_ E[x*] ας ue 


(*) = =8= l)(n — 2)...(n —r + 1) 


natural (or Naperian) logarithmic 
function 

base of natural logarithms (Ine = 1) 

exponential function 

hyperbolic sine 

hyperbolic cosine 

inverse hyperbolic functions 


(ne ἢ, τε Ζ᾽ 
(πε Ζ᾽ 


complex number z 


real part of z 
imaginary part of z 
modulus of z 

argument of z 

complex conjugate of z 


mean or expected value of random 
variable x, distributed with 
probability p 

variance of random variable x 


standard deviation 


probability generator 


XI NOTATION 


= G'(1) 

σ΄ = G"(1) + ΟἽ) — [“ 111 
= np 

a” = npq 

D(x) 


P(x) = O'(x) 
x 
=> O(X) = | (x) dx 


A= | a0 dx 


σ | (x — p)*p(x) dx 


= | eo. dx — p? 


p(x) = eS 
| ‘2 o 
| 2 
P(x) = i 
aj ἐπ 
b, = =e" 
rs 
E(m] = pu 
σΞ 
Om] = “- 
il 
σ 


For a bivariate distribution (x;, y;): 


το ΣἊ γ-- Zi 


tS --ο;᾿͵᾿ὀὸἶὖὀο:::,ἢ 
i 


i 
2 24-3)" Sa 


ΒΥ = 
HR i 


mean and variance of binomial 
clistribution 


cumulative probability, or 
distribution, function 
probability density function 


mean and variance of 
continuous distribution with 
p.d.f d(x) 


normal distribution with mean μ 
and standard deviation σ 


normal distribution with mean 0 
and standard deviation | 


Poisson distribution with mean ἡ 


expected value, variance and 
standard deviation of distribution of 
mean m of sample size x (drawn 
from population with mean p and 
standard deviation o) 


expected value of variance s? of 
sample size n 


unbiassed estimate for standard 
deviation of a population when 
sample size ἡ has standard 
deviation ὁ 


means of x, y 


Variance of x 


C= —= 
᾿ ἢ nN 
ei = aha 5) 
Salamanca 
> xy δ. 
= ot τ 
᾿ 
Sxy 
3 
Sxy 
7 
r a 
SSy 
“ν΄ 
Ρ n(n? — 1) 
ὃ 
ht a 
gn(n — 1) 


Matrices and determinants 


Α.Β.Ε'... 

det A or |A| 
Moe AG cae 
ΑἘ ΒΟ. 
ΠΝ Νὰ 
Aj; 

AB 

Sets 

Φ 

ῳ 

R 

C 

eg GQ R* 
Z*, ΟἿ, R* 
Groups 

G, * 

ἔς 


NOTATION XL 


variance of y 


covariance of x, y 


regression coefficient of y on x 
regression coefficient of x on ἡ 
correlation coefficient 


Spearman’s rank correlation 
coefficient 

Kendall’s rank correlation 
coefficient 


matrices 

determinant of square matrix A 
transposed matrices 

adjoint matrices 

inverse matrices 

cofactors of a;; in det A 

product matrix 


integers 

rational numbers 

real numbers 

complex numbers 

positive integers etc. 

Z, Y, RK, but omitting zero 


group formed by elements of G 
under binary operation * 

cyclic group of order ἢ 

dihedral group of order 2” (eroup 


of reflections and rotations of 
regular n-sided polygon) 


XIV NOTATION 


Ile 


Boolean Algebra 


A+ B(=AvUB) 
AB (= ANB) 
Ea 

@ 

ὄ 

A’ 


symmetric (permutation) group of 
order 7 
is isomorphic to 


union of sets A, B 
intersection of sets A, B 
inclusion (A < Bor Α > B) 
null (or empty) set 
universal set 
complementary set 


Throughout the book, and in obedience to the recommendations of the 
British Standards Institution and the Royal Society (though contrary to 
the preference of I’.J.B.), the aligned dot denotes a decimal point and not 
multiplication: so that 2.3, for instance, means 234; rather than 6. 


Further calculus 


9:1 The logarithmic function, In x 


When we first considered integration, we found that 


+1 
[ea = = +c, provided thatn # —1. 

When 2 = —1, however, this expression cone the meaningless term 
t°/0. N evertheless, the problem of finding { ¢~’ αἱ is certainly not meaning- 
less, and will arise whenever (which is frequently) we need to know the 
area beneath a section of the curve f(#) = 1/é. But the reader will probably 
find that, despite all his efforts, he fails to discover a function whose deriva- 
tive is 1/t, and must therefore return to a more careful examination of the 
area which this integral represents. 

We start by considering the area beneath the curve f(t) = 1/t between 
t= | and ἐ = x, which we call / (x). 


F(t) 


2 FURTHER CALCULUS 


ἯΙ 
So I(x) = | Oh which is defined for all x > 0. 


1 


Clearly, if el. dix) »5ῸῸ; 
if Ov xe ly. 24) 05 
and (1) =0. 


We can now estimate / (x) for any positive value of x by means of counting 
squares or by Simpson’s rule. 


Exercise 9.1a 


1 Count squares or use Simpson’s rule (with four intervals in each case) 
to estimate the values of: 


(i) 1(2); (ii) 1(3); (iid) 1(6); (ὦ) L(9); ὦ) [(12}; (wt) {(2); (vt) (8). 


Properties of | (x) 
After this last exercise, the reader may have suspected that 


1(6) = 1(2) + 1(3), [(42) = 1(2) + 1 (6), 
1(32) = 21(3), 1(2~") = —1(2). 


More generally, we shall now prove that 


(1) E(xy) = U(x) + L(y), 
and (11) I(x") = nl(x). 


(1) By definition, 
ae an 


1 1 
and we can transform the second integral by changing the variable. 
Regarding x as constant, we let u = xt so that du = x dt. 


Also ¢=1 => u=x, and t=y > U=xy. 


»] xy 1 

So [ca | — du 
if x Uu 

= —di 
Ἐπ | 


(as the letters t and wu are themselves irrelevant). 


aaa | xy | 
Hence ia) τῳ = | rat+ | re 


1 x 


=> I(x) + [() 


I 
~~ 
nro 

ὃς 

SS 
~~ 


9.1 THE LOGARITHMIC FUNCTION, Inx 3 


This substitution can be illustrated by considering the shaded area 
between ¢ = 1 and ¢ = y and squeezing it, like toothpaste, into the area 
between ¢ = x and ¢ = xy: horizontal dimensions are stretched by a 
factor x and vertical dimensions compressed by a factor 1/x, so that areas 
remain unchanged: 


fl) =4 


x" y 
(ii) Similarly, J(x") = | - dt; 


and if we now let ¢ = wu”, it follows that dt = πε ἢ du. 


Also ¢=1 => uw=1 and t=x*" => uw=x. 


x n-1 x] 
So Hat) = | die du --α] ~ du 


Now as these are also properties of logarithms, the question immediately 
arises whether /(x) itself is in fact a logarithm. But first we must introduce 
the crucial number 6 such that 


1.e., such that ¢ = ¢ is the ordinate up to which there is unit area beneath the 


4 FURTHER CALCULUS 


given curve: 


By counting squares, we can show that 6 ® 2.72, and we shall soon be 
able to calculate it to a much greater degree of accuracy. 
Using the above results, we see that 


I) = I(x)l(e) = I(x). 


So the areas under the curve to the value εἶ“) and to the value x are equal. 
These must therefore be the same, and εἰ = x 


=> | I(x) = log, x 


log, x is usually called a natural, or Naperian, logarithm (after John 
Napier, 1550-1617) and is sometimes denoted without a base, simply as 
log x; but more customarily as In x. We shall also usually write 


] ] di d 
[14 and [1 ἀν as | F ana [Ξ 
t x ΐ x 


So, in summary, if x > 0: 


and the relationship between 


] 
— and Inx 
x 


9.1 THE LOGARITHMIC FUNCTION, Inx 5 


can be conveniently illustrated by their graphs: 


70) 


In particular, we notice that 


when x < 0, In x is not defined ; 
as x— +0, Inx— +0; 
as x 0+, In x — — ὦ. 


We are now able to find other areas bounded by such graphs. 


Example 1 


Investigate the areas represented by 


> dx 2 2 dx 
tg ee i) | = i) | a 


(1) | = Tepresents an area below the x-axis, and our natural approach 
-3 
would be similar: 


6 FURTHER CALCULUS 


But the reader will rightly object that In (—2) — In (—3) is completely 
meaningless, since neither In (—2) nor In (—3) has been defined. We 
can, however, easily side-step this obstacle by using the symmetry of the 
graph about the origin, observing that 


-2 3 
τς Ξε - “ΟΕ —[In x] 
_3 ἃ 9 Xx 2 
= —(In3 -- In 2) 
= —Inl.5 
=~ —0.41. 
2 
(222 ) εὐ 
αν fl)= Ὁ 


We have already seen that In x > --οὐ as x > 0+, so the area beneath 
the curve between x = 0 and x = 2 is infinite; and similarly the area to 
the left of x = 0 is infinite. 


9.1 THE LOGARITHMIC FUNCTION, Inx 7 
So tae dx/x is composed of two parts, both of which are infinite. It is 
therefore meaningless and without value. 
Example 2 
Differentiate: 
x74] 


(1) In 5x°; (a) In 5 (272) logy X. 
x 


(1) InSx? =In54+1nx® =In5+ 3lnx 


d d 3 
=> q, ne = τ (In5 + 3lnx) Ξ -- 


dx ~ dx x 
d l 3 
Or — (In 5x*) = — x 15x* = -. 
dx % Χ 
d 24] d 
(2) — In = 1 = — [In (x? + 1) -- Inx] 
dx x dx 
hes x 2 
Fed ᾿ 
ΒΕ οὐ, 
x(x? - 1) 
d 24] — (x? 
ΩΓ 1 ᾿ itt RK De oe 1) 
dx x χ 4+] x 
_ x? — | 
x(x? Ὁ 1) 
(111) Let y = log.) x. 
Then x = 10 
> Inx = yln 10 
es _ Inx 
*~ In 10 
dy Ϊ Ι 
=> Sa See ONES -.-- ----.----. 
dx (Inl0)x (2.302 6)x 
d 0.434 3 
So ας ἴδ: Δ) “Ἢ “- 
Example 3 


x2 . ᾿ f(x) . -" 1/4 
(1) | sare (12 ) ee dx; (21) | tan x dx. 


2 
(1) Let i- [ἀν 


r+ ] 


8 FURTHER CALCULUS 


Now w=xe4+1 => du = 3x? dx. 


1 
ad 
So r= [3S “=4mute 


Uu 


2 
d 
=> [3 Sage ΤΟΣ Daa 


Here again we let u = f(x) => du = f'(x) dx. 


d 
So J= [f= ιν τὰ 


" 


=Inf(x) + 6, 


so that we are now able to integrate any quotient whose numerator is the 
derivative of its denominator. 


n/4 
(11) [= | tan x dx. 
0 


This can be expressed as 


m/4 os πί4 : 
sin Xx —S1n X 
— \ dx = -| dx. 


9 cos x 


So, using (11) above, 


I = [-In cos x 


πίά 
Hence | tanxdx = 5 Ιὴ 2. 


0 


Exercise 9.10 


: Given that In 2 = 0.693 15,1η3 = 1.098 61, In5 = 1.609 44, calculate 
to 5 places of decimals: 


(1) In4; (ἢ) In6; (az) In 10; (0) In 12: (0) Ing; 
(vi) Int; (vii) Indy; (vit) In 105; (éx) In. 2; (x) In ¥/100. 


2 Evaluate: 


(i) Ine?; (ii) In 1; (a) In =; (iv) In Je. 


3 Solve the equations: 
(i) nx =2; (i) Inx = 100; (11) ἰὴ χα = τς; (w) nx = —2. 


9.1 THE LOGARITHMIC FUNCTION, Inx 9 


4 Differentiate with respect to «x: 


(1) In 2x; (1) Inx; (12) In 3x*; 
] 
(1) In x (v) In τ’ (vi) In ./x; 


Ι 
Cais, “Grin = τ tae 
x 
(x) In x/ (x + 1); (x1) xInx; (x72) ume 
x 


5 Differentiate with respect to x: 
( 


2) Insinx; (11) Incosx; (111) In sec x; 
(10) In(secx - tanx); (0) In (3sinx); (υ1) In (sin 3x); 
(viz) In (sin? x); (vizt) In (4 sin? x); (2x) (In x)?; 


(x) In (In x). 


6 Find the following integrals: 


dx ο, dx 

(1) I (11) "τὸ τε 53 (222 ) [τς yar 3 

, 2x dx 

a of om 15. 
x ae x? + ] 

τ ΟἹ Pete rs ὦ |e αὶ 
cot x dx; i) | tan 3x dx; (x27 ) ) | cot 3x dx; 


sin xX — cosx + tan x 


ὩΣ a fastens, χ: it) [as 


7 Evaluate (or, if this is impossible, illustrate) : 


dx ᾿ 6 dx . 3. dx 
ofS ὦ | il | sy 


3 dx 3 dx 3 x dx 
(20) \ Rae 4) (v) Ϊ τς; Ὁ" (v2 ) Ϊ σα, 


8 Given that In 2 = 0.693 15, use differentiation to estimate (to 5 places 
of decimals): 
(1) In2.001; (2) In 1.999 9. 


g Find the volume of the solids of revolution formed when the area 
(1) between y = Lal % x = 1, x = 2, and y = 0 is rotated about the 
x-axis} 


10 FURTHER CALCULUS 
(11) between y = 1/x?,y = 1, y = 4 is rotated about the y-axis. 


10 By using the Newton—Raphson method, or any other suitable iterative 
method, solve the equation 


x+Inx« = 3, 


given that the root is close to 2. Obtain the root correct to two decimal 
places. (J.M.B.) 


1r (1) Using the Newton—Raphson method or otherwise, solve the 
equation 

Inx = $x, 

giving your answers to four significant figures. 


(1) Find the gradient of the straight line through the origin which touches 
the graph of In x, and hence show that 


Inx = ox 
has a solution whenever « < 1/e. 


12 Boyle’s law states that, at constant temperature, the pressure p and 
volume V of a gas are such that pV = c, where ὁ is a constant. If the work 
done by the gas when expanding is \p dV, show that this is constant 
(= c In 2) whenever it doubles its volume. 


14 Ifan animal grows without changing its shape then its length x, surface 
area S and volume V are such that 


S = αχ, V = ὀχ (where a, ὦ are constants). 


Hence show by differentiation: (a) without use of logarithms; (4) after 
first taking logarithms, that slight increases 6x, 6S and OV are related by 


i.e., that fractional increases of S (and V) are twice (and thrice) the fractional 
increase of x. 


9.2 The exponential function, e* 


We now proceed to investigate the function f(x) = e*. This is clearly a 
power function, and we start by finding its derivative. 


Now y= 
=> x =Iny 
dx 1 
=> --- = -- 
ἂψ y 
d 
> OF ny = 6. 
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d 
So —(e*) = 


dx 


and we immediately see that e” is the particular power function whose rate 
of increase is always equal to its value: 


It was mentioned earlier that power functions are sometimes referred to 
as exponential functions. Because of the supreme importance of é it is 
usually called the exponential function, and is sometimes written exp x. 

Similarly we see that 


d 
dx 


(e*) =e *x -l= —e™* 


SF (*) 


so that e * is the corresponding function of exponential decay. 
These two functions, and variants of them, occur whenever the rate of 
growth of a quantity, or its rate of decay, is determined by its present 
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amount. So, for instance, the spread of an infection in the first stage of an 
epidemic is likely to involve a positive exponential function, whilst the 
decay of a radioactive element is governed by a negative exponential. 


d 
Lastly, just as — (e*) =e 
dx 
SO {eX dx =e +e 
Example 1 
Differentiate: (1) e°; (ii) 105. 
. d 2 2 2 
(7) (ee x Oe = Dee. 
dx 
d d 
i) — (10%) = — (elm 10) 
(ii) = (0Ὁ = — (515) 
d x1In10 
ae ) 
τ I in 10 
= 10* In 10. 
Alternatively, y = 10* 
=> In y = In 10% = x In 10 
Id 
=> 2 Wa ὃ 
y dx 
dy 
— = yln 10 
δ ἂχ 
d 
=> — (10*) = 10* In 10. 
dx 
Example 2 
i 
Find the integrals (1) | e** dx; (it) [e dx 
0 


d 
(1) Since — (e?”) = 2 e?%*, 
dx 
it immediately follows that 
i 
| e** dx = [5 65: = $(e? -- 1). 
0 


(11) As in Example 1, we can easily show that 


d 
— (2*) = 2* In 2. 
τς (2) 2* In 


x 
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9. THE EXPONENTIAL FUNCTION, ὁ 


2” 
S ‘ = 
O : dx τὺ 


Example 3 


When a charged condenser is discharged through a particular circuit con- 
taining an inductance and resistance, the current x at time ¢ obeys the 
equation 


Show that x = e~‘ cos 2t¢ satisfies this equation and sketch the graph of the 
function. 


If x =e ‘cos 2t, 
d 
then “πες e' x —2sin 2¢ -- e ‘cos 2, 
dt 
= ρ΄ ‘(— cos 2t — 2 sin 22), 
d*x ore = : 
and aa (2 sin 2ὲ — 4 cos 2t) — e “(— cos 2t — 2 sin 2¢) 


=e '(—3 cos 2t + 4 sin 22). 


So we verify that 


Ea ee, 
di? de 
=e ‘{(—3 cos 2t + 4sin 2t) + 2(—cos 2t — 2 sin 21) + 5 cos 2¢} 


= 0. 


Now x = e ‘cos 2¢ is the product of an exponential decay and a cosine 
function: 
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Combining these, we obtain the graph of x = 6. ἷ cos 2¢ and we readily see 
that it represents a decaying oscillation. 


Example 4 


A bathful of water is at a temperature of 80 °C in a large room whose tem- 
perature is 20 °C. The water cools at a rate which is proportional to its 
excess temperature and takes 5 min to reach 70 °C. 

Find the temperature of the water after ¢ min. 


. . d 
If the excess temperature is θ °C, the rate of cooling is Fae 


dé 


So =a. k@, where k is a constant, 
dt l 
=> SS, SS τς 
dé k0 
] 
=> i= 7 In 6 + ¢. 


But, when ¢ = 0, 6 = 60. 


l 1... 
So O= —-In60 +c => c = —1n 60 
k k 
t in a 60 
=> = — = 
poe ye. 
] θ 
k 60 
θ 
=> In— = —ki 
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So far we have only used the general ‘law of cooling’ and the ‘initial’ 
condition that 6 = 60 when ¢ = 0. But we also know that when ¢ = 5, 
= 50. 


| 

| 
> 
x 
σι 


So Inz= 
=> k 


θ 
So In— = —0.0365t 
O nN 60 


θ 
= Le y= 0.038658 
60 
σῶς θ-- 60ε.9:9365:. 


So the temperature of the bath after ¢ minutes will be 
20 + 60 ¢ 9555: 


Temperature/°C 


ct er ee ce ee ee ee oe 


t/min 


We can now use this function to calculate the temperature of the bath at 
any subsequent time. For example: 


iff = 10, temperature = 20 + 60e 5265 

= 20 + 60 x 0.694 
20 + 41.6 = 61.6 °C; 
if¢ = 20, temperature = 20 + 60¢ 573 

= 20 + 60 x 0.482 

= 20 + 28.9 = 48.9 °C. 


Exercise 9.2 


1 Use tables to evaluate e* when x = 10, 100, τς, —10. 


2 Find values of x such that 
(1) οἴ = 10, 1000, 10°; (1) e * τ τίς, 1072. 107°. 
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4 Differentiate with respect to x: 


(2). ἐς: (ii) 5»; (111) ae ὅ; 

4 sete 

Gio) So) PS (WH) 

(vii) e “sin x; (vit) e** cos 3x; (ix) 3%; Came ae 


4 Find the following integrals: 


(i) [πὰ (i) te dx; (iit ) | seas 
(τυ) [που αι (υ) | ret as (v2 ) [ 10 as. 


5 Evaluate the following integrals: 


2 3 1 2 
(1) | e** dx; (i) | e°* dx; ai) | xe?’ dx; (iv) | 10* dx. 


0 1 0 0 


6 Investigate the stationary values and sketch the graphs of the following 
functions: 
(i) xe7™; (ὦ) x2 e7*; (iti) χ' δ; (ὦ) 9". 


x 
7 Find | e ~ dx. 
0 
By letting X¥ > +00, investigate the area which lies between the curve 
y = e *and the two axes (for convenience called i, e *~ dx). 


8 After time ¢ the displacement x of a heavily damped pendulum is given 
byx = 29 ‘sint. 

(i) For what values of ¢ is the displacement zero? 

(1) For what values of t is the displacement greatest? 

(777) Sketch the graph of x against ¢. 

(wv) Prove that 

d? 


A a oy 
di? dt oe 


g A biologist knows that a certain organism, of which there 15 initially 
2 g, is growing continuously at a rate of τὸ g/day for each gram of its mass. 
If its mass after ¢ days 15 m g, 


d 
(1) show that τι = - m; 


ii) deduce that m = 2 e?°; 


( 
(iti) by what percentage does it increase each day? 
( 


iv) how long does it take to double its mass? 
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10 A sample of radium loses mass at a rate which is proportional to the 
amount present. 
(2) fits mass is m after time ¢, show that 


dm 
— = —km; 

di 

(11) ifits initial mass (when ¢ = 0) is my, deduce that 
m=moe ™; 

(111) if its mass is halved in 1 600 years, calculate k; 

(w) calculate the percentage rate of decrease per century. 


11. A man with toothache leaves home at noon for the dentist, whose 
surgery is 1 km away. He starts by running at 10kmh~’, but gradually 
slows down so that his speed is always proportional to his distance from the 
surgery. 

(1) How far has he gone in the first 5 min? 

(72) How long will he take to get within 100 m? 

(11) When will he arrive? 


9-3 Interlude: infinite series 


In Chapter 6 we investigated a number of finite series and also mentioned 
the ‘sum to infinity’ of a geometric progression. 

It was proved, for instance, that if S,, is the sum of the first n terms of the 
series 


So S,— 2, 
and we say that the infinite series 
l+o+et+etuet: 


is convergent, and that its sum (or sum to infinity) is 2. 
More generally, we can consider the infinite series 


uy + Uy + Uz Ἔ:: 
and suppose that the sum of its first n terms is S,, so that 


Sy = Uy + Ug + Ug teoeet Uu,- 
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If δ᾽, -- Sas n— οὐ, we say that the infinite series 15 convergent, and that 
S is its sum. 

It is beyond our present scope to investigate the convergence of series, 
but we can consider some elementary examples: 


Example 1 

Peo ata ae 

Certainly the terms of this series become very small. If we look at the 
successive values of S,, we obtain (approximately) : 


1, 1.500, 1.833, 2.083, 2.283, 


and before long it seems that the values of S, are having such a struggle to 
increase that they must necessarily tend to a limit, which will be the sum 
of the infinite series. But a little more thought will cause us to write the 
serles as 


ao PG) Gg be) og an eae) 
> 145+ 2xgZ + 4x + 8x τ + 
>1+5+ 5 + 2 + 2 ++: 


Now this series, although growing very slowly, is completely unlimited. So, 
contrary to our expectations, the original series 
l+g4+3+a+°°: 


is also unlimited and does not converge (even though, as we shall later 
prove, the sum of its first million terms is less than 15). 


Example 2 


The series 


“ | Ι I ] ] 


abet get gat get eet ge? 


92 42 2" 62 72 aa 


can be written as 


tt(atalt(atatatalt(st ἘΠ ἘΝ 
τ δ ae ὌΝ Ke 15: 
Ι Ι 
«161 2X55 + 4x + 8X 4+.--- 
Pee ie 5 oe + ΩΣ ἐ Awe Ὁ 
In this case, therefore, the series 


Ι Ι ] 

92 T 3 Tee 42 = a 

does converge to a sum less than 2 (and actually, though we cannot prove it 
here, to 4717). 


eras: 


9.3 INTERLUDE: INFINITE SERIES [0 


Example 3 


As a warning that we must treat infinite series with a certain respect, we 
now consider the series 


[a a eee 
It can be shown that this series is convergent and also, perhaps sur- 


prisingly, that its sum is In 2 ® 0.693. So it would appear to be an innocent 
operation to rearrange its terms as follows: 


πὰ al a are oe ek τὰ 
Sa) Wg) π΄. ᾷ ὦ... 
A wg ad “SG eee Gags 
πο πὰ 


But the reader will immediately see that we have ‘ proved’ that 
In2 = Σ]η 2. 


This paradox, and others like it, show that the theory of infinite series 
must be developed with very great care: though we cannot attempt this 
here, it forms a major branch of modern mathematics. 


Example 4 
We now return to the geometric series 
ΙΕ a a ees 

The reader will recall that 

ee ὦ Ι x 


Ss = —— = ——_ — 


i Ι] --χ l—* 1]1--χ 


and that if κα < 1, x"—+0 as n> ©. 


So if Fa < 1 the series is convergent to the sum 


— x 


This is a particular case of a power series, whose terms are successive 
powers of x and which converges for a set of values of x called its domain of 
convergence. 


The general power series can be written 
ἄρ + ayx + agx* + agx? +--- 
0 1 2 3 


where dy, 41, @),... are particular constants, and we should expect such 
a series, if convergent, to have a sum that is a function of x. It will, for 
instance, be shown in the next section that 


τς 1,.4 
XH — 9X" + χ᾽ — τα - τ’ 
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is convergent to the sum In (1 + x), provided that 1 <x <1; 


and that 


2 x? x4 


x 
Rarer pag τ 


is convergent to the sum δ᾽ for all values of x. 


9-4 Maclaurin series 


We shall begin by reversing the question of the last section and ask whether, 
if we are given a particular function f (x), we can discover a convergent 
power series of which it is the sum. For this purpose we shall have to make a 
number of assumptions. 

Firstly we shall restrict ourselves to functions which are differentiable 
not only once, but any number of times, i.e. whose derived curves are all 
continuous and smooth. 

Secondly, we shall assume the existence of such a series for a certain 
domain of x, and our problem will be simply to find the series, granted that 
it exists. This may seem rather strange but, throughout mathematics, prov- 
ing such existence is frequently a major problem, and the only statement 
we shall be able to make here is that, if a particular function is expressible 
as a power series, then this series must be such and such. 

Lastly, we assume the fact that if this series exists and is convergent for a 
certain domain of x, then, within this domain, it is differentiable ‘term by 
term’. By this we mean simply that the derivative of its sum is equal to the 
sum of its derivatives, even though there is an infinite series of them. In 
other words, that just as 


d ἀμ. du 
ax) ae ae 
and tte ee 
dx dx dx dx 


and so on for any finite series, so also it is true for our infinite series that 


d du ἂν dw 
ee ee ee τ a 
Let us suppose that f(x) is a function which, throughout a certain 
domain including x = 0, 
(1) is differentiable any number of times, and 
(11) is the sum of a convergent power series. 


Let this series be 


St (x) = Ay + ayx + agx? + agx? + agx* Ὁ...’ 
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Differentiating term by term and putting x = 0, we obtain successively: 


f(x) = dg + ayx + α,χ + asx? +agxt+--- => f(0) = a 

ST (x) = ay + 2agx + 3agx? + 4α,χ" -Ἑ τ" => f'(0) =a, 

f(x) = 2ay + 6agx + 12agx? +--- => f"(0) = 2!a, 
F(t) = Cag + 2agx +--- => 2,7 (0) = 3:69 
f(s) = Wa, +>: - 7500) = 4a, etc 


This is known as Maclaurin’s series (after the Scots mathematician Colin 
Maclaurin, 1698-1746), which we can now illustrate by three very im- 
portant examples: 


Exponential series 


If f(x) = δ᾽ 
then f'(x) = f"(x) ΞΞ- - -- = f(x) -- οἷ. 


It can be proved that this series is valid for all values of x; as, indeed, we 
would expect, since however large x might be (say x = 1 000), the great 
size of x” is eventually overcome by the even greater size of n!. 

We can now use this series to calculate e to any required degree of 
accuracy. For, putting x = 1, 


1 1 
+—+—+4--: 


1 
uae yay a4 


= Ι ] 
e= ae A 


and the calculation can be set out to seven places of decimals: 
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1.000 000 0 
= 2) 1.000 000 0 
3) 0.500 000 0 
= 4) 0.166 666 7 
- 5) 0.041 666 7 
= 6) 0.008 333 3 
Pa 7) 0.001 388 9 
= 8) 0.000 198 4 
= 9) 0.000 024 8 
r. 10) 0.000 002 5 
a 0.000 000 2 
2.718 2815 


We see that this series is rapidly convergent and that e = 2.718 281 5. 


The careful reader will note that this value of ὁ is subject to two sources 
of error: 
(1) because individual terms have been calculated only to seven places of 
decimals. 


] 
When, for instance, we state that a 0.166 666 7, 


we are asserting that it lies between 0.166 666 65 and 0.166 666 75 and 
so is subject to a maximum error of 5 in its eighth decimal place, i.e., 
5 x 107°. As there are eight such terms which have been corrected in this 
way, the maximum error from this source is 40 x 1078 = 4 x 107’. 
This is known as the rounding error. 
(11) because the series was cut short at 1/10!. 

The remainder after this term is clearly 


ΠΣ ἘΠῪΣ ἘΠ 
11:12! 18! 


] ] ] 
-il'+ ats τ} 
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] 12 


«τί! ἐπ τα] το] 
11! 12 12 

] 

1! 


«--- x --- - 
] Ι-- - 11x12! 
12 
Ν ----.-.ο-.-. : 10-8, 
Ow ibe ~77 10 


So the maximum error from this source is less than 2.7 x 1078. This is 
known as the truncation error. 

We therefore see that the total error is less than the sum of these, and so is 
less than 4.27 x 10-7 < 5 x 1077. 


Hence ¢ lies in the range 2.718 2815 + 5 x 1077. 
=> 2.718 281 < e < 2.718 282 
=> e = 2.718 28, correct to five places of decimals 


(in fact e = 2.718 281 8 to seven places). 


Exercise 9.4a 


1 Evaluate the following functions at intervals of 0.5 from x = —1 to 
χ τ τ 2: 

1c (1) 1+ x; (11) 1 +x + 3x; 

(iv) 1 + x% + dx? + dx? (0) L+x Ὁ gx? + dx? + dt; 

( 


vt) e. 


Hence, on a single accurate figure, plot the graphs of these functions from 
x= —ltox = +2. 


2 Find the power series, as far as the x* term, of: 
es Cat ae (11) S(e* +e *); (iv) 5(e* — 5). 
3 Use the exponential series to calculate to five decimal places: 


Ι 
(i) Je; (ἢ) SS iat 


Logarithmic series 
If Inx = ag + ayx + agx? + a3x° +--- 
it would immediately follow that 

a) = In 0. 


But In 0 does not exist, so we cannot express In x as a power series. 
We can, however, consider the function In (1 + x). 
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Now f(x) = In (1 + x) => f(0) =Ini=0 


= f= = f(0) =1 
a es in 

> γωτπτη "0 = τ 

- ωπτητῖο = 70) = +2 


= se ey ee ἜΣ eee 
Ix) = (- 1) (+x 


So, with the usual assumptions for Maclaurin’s series, 


=| pe 2 = 1] 
ὙΠ’ 6. Aaa 


21 4! n! 


and it can be shown that in this case the domain of convergence is 
—-l<x<l. 
Putting —x for x, we obtain 


In (1 — x) = --α --χξ — 9x? — Gxt -- ... (2) 
whose domain of convergence is —1 < —x < l, ie, -l<ex<l. 
If we now subtract these two results, we obtain 
In (1 + x) — In (1 — x) = 2( + Gx? + ξχ Ἐ -..) 

I+ x 


pin et ge! Ἐ τα to (3) 


=> 


which is convergent provided both the original series are convergent, i.e., if 
—-l<x<l. 


It is instructive to use these series to calculate In 2. 
If we put x = 1 in (1), we obtain 


In? = l —-$+3-Gt-°°° 
But the millionth term is — 1/10° and so is still affecting the sixth decimal 
place. The series, therefore, converges so slowly that it is not very useful 
for the calculation of In 2. 

If, on the other hand, we use (3), we see that to calculate In 2 it will be 
necessary to let 
l+x 
Ι =x 


as met, At 
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Putting x = 5, we obtain 

gin2=3 + 3(3) +33)? ἜΡΟ 
=> In2= 203 Ὁ 56)" Ὁ 56)" Ὁ... 
and it is clear that this series converges much more rapidly. How many 


terms of this series need to be taken in order to determine the sixth decimal 
place? 


Exercise 9.4b 

1 Evaluate the following functions at intervals of 0.5 from x = —1 to 
x= +1: 

{1} 2 8 (1) x -- $x?; (i) x -- 4x? + $x°; 

(wv) x — ἔχ + gx? — Gxt; (v) In (1 4+ %). 

Hence graph these functions on a single diagram from x = —1 to +1. 


2 Find the power series, as far as the x* term of: 


(i) in( +3); (i) In(1 + 2x); (iti) m-— 


—x 
. ἕν tea oe ; 
4 Use the series for σ In ae to evaluate, to three decimal places: 
. (t) In3; (1) In2- 
Hence calculate In 5. 


Binomial series 


We now turn to the function 


More generally, 


f(x) = n(n = l)...(n —rt+ 1)(1 + x)"? 
=f) = n(n — 1)... +). 


So, using the Maclaurin series, we obtain the binomial series : 


and it can be shown that the series is convergent, provided |x| < 1. 
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Ifn € Z*, we notice that the series terminates and reduces to the binomial 
theorem. 
For instance, ifn = 4, 
x 3 4x3x2, PID 9 Ὁ χε ἃ 


4. 
τ ae) Nl Oi es laa Te x? - ai xo re 
4x3x2x1x0, 
5! ᾿ 
4x3x2x1lxOx-l, 
6! i 
-- 14+ 4x + 6x? + 4x? + x7. 


More generally, πε Ζ΄, 


n(n — \\(n — 2)...(n —7 + 1) 


-- ὦ, ifr<n, 
r! 


and Ὁ ifr>n 


so that the binomial series simply becomes the binomial theorem. 

If n¢ Z* it is impossible to speak of "C,, as “°C, *C, etc. have no 
meaning. Nevertheless, we find it convenient to have an abbreviation for 
the coefficients of the binomial series, and write 


n(n — 1)(n — 2)(n - γι τη ἢ ας (") 


Example 1 


-2 
Find a series for ( — *) 


(9 


Ι 
+ 
| 
aS 
| 
| 
NoO|& 
NY, 
+ 
| 
No 
ee 
= 
-. NOIR 
N 


Lx t ξχ + Gx? +:-: 
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which is convergent if - <1 = |x| <2. 


Example 2 
Use the binomial series to estimate 3/999 to seven places of decimals. 
4/999 = (1 000 — 1)" 

= 10(1 — 0.001). 


1 . 2 
Now (1 -- 0.001)!’ = 1 + :(-- 0.001) + ae: (—0.001)? 
1 πα . 5 1 . 2 .-. 9 . 8 
5 * = (0.001) ca cities ae ee ἊΝ 3. Χ 5... 6001) skh 


The first three terms are 


+ 1.000 000 000 0 
-- 0.000 333 333 3 
— 0.000 000 i111 


= + 0.999 666 555 6 


It is easy to see that the maximum rounding error is 10 '°. Also the 
remainder is 


5 10 
— — (0.001)? — —— (0.001)* —--- 


so the truncation error is 


9 10 
— (0.001)? + —— (0.001)* +--- 
(0.001)? + τὰς (0.001)* + 


81 
5 

< 3] (0001)" [1 + (0.001) + (0.001)? +--+] 
5 Lg 1 000 a 

< 8] x 10 x 9 < 10 ; 


Hence the total error «2 x 107'° 

and 2/999 = 9.996 665 556 

with a maximum possible error of 2 x 107%. 
So Ἢ 999 lies between 9.996 665 554 
and 9.996 665 558. 


Hence 2/999 = 9.996 6656 to seven decimal places. 
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Exercise 9.4c 


1 Find the first four terms of the power series of the following functions, 
in each case stating their domain of convergence: 

l 
1 ΠΡ 2 : 
@ Jat); @) 
(iv) (1 — 2.1) 1; ἡ (2. - Γ᾽; (vt) (4 — 2x). 


Ω 


2 Evaluate, to five places of decimals: 
(i) “1.01; (a) 9/0.99; (aid) «(4.1; (ὦ) 2/27.27; (v) (on? 


3 Obtain binomial series for 1/(1 — x)* and 1/(1 — x)%, and check by 
means of differentiating the series for 1/(1 — x). 
Hence sum the series: 


Lc 20) ae 89) (a) Ἐπ:: 
4 (1) Using /(x? + 1) =x(1 + 1/x?)'/?, show that if x is large: 
Ι 


Ι 
τ, (x? +1) ext x Bx +++ (known as an asymptotic series). 
x x 


(72) Find the first three terms of similar asymptotic series for 1/(x — 1)? 
and ey (0? 2). 


(1) (1+ x)7?; 


Taylor series 


The Maclaurin series enable us to find the value of a function near χα = 0 
in terms of its value and those of its derivatives at x = 0. If we now seek 
to investigate its value in the neighbourhood of another point x = a, we 
can let the point be x = a + hand write 


flat ἢ Ξ Fh) => f(a) = FO) 
sothat f‘(a+h) =F'(h) => f(a) = F'(0) 
f"(ath) =F'(h) > 2 (α) = F'(0). 


2 


But Κα) = F(0) + ΜΕ (0) + FO) ee 


which is known as the Taylor series (after Brook Taylor, 1685-1731), and 
which enables us to find the value of the function f(x) near x = a in terms 
of its value and those of its derivatives at x = a. 

In particular, if ἢ is very small, 


fla th) = f(a) + Ya) 
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provides a linear approximation to the function, already familiar in the 
form 


fa+h) —f(a) 


γὼ xe 


Exercise 9.4d 


1 Find, as far as the x* term, Maclaurin series for 
(1) sinx; (i) cosx; (11) tanx; (10) secx; (v) e*sinx; (01) Insecx. 
2 Use Maclaurin series to calculate to five decimal places, tan 0.1, tan 1°, 


tan 1’. 


4 Find the first four terms of the Taylor series for tan (7/4 + Δ) and 
hence calculate, to five decimal places, tan 46° and tan 45° I’. 


4 Obtain the Maclaurin series for tan” ' x and check by integrating the 
binomial series for 1/(1 + x*). Hence 
(1) obtain Gregory’s series: 


(11) use the result (see Exercise 5.13a, no. 3) that 


Ἢ aie 1 
= tan 9 + tan 3 


e/a 


to show that 


9.5 Integration by parts: reduction formulae 


We now return to the problem of integration. As was observed in Chapter 4, 

a result about differentiation can often be transformed into an equally 

useful one about integration, and we saw how the chain rule for derivatives 

led to the method of substitution for finding integrals. We now proceed to 

look again, in just this way, at the formula for the derivative of a product. 
If u and v are two functions of x, we know that 


© (w) =u +4 
dx me Fd 
d d 
= | oar + [Soa 


So uv 
dv du 
=> [υξξα = wo - 4,2 
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This is the rule for integrating by parts and is useful whenever the product 
ui, dv 

— v 15 easier to integrate than u —- 

dx dx 


If we now replace u and v by U and V, where 


U =u and pa 
dx 


the result can be written as 


ἷ | ores ΞΕ υ] Vdx -- \(S] V dr) d 


Example 1 


[ cosa = xsinx — [πὰ νὰ 


= xsinx + cosx - 6. 


Example 2 


1 1 
| x? δ᾽ dx = [x? ἊΝ - | 2χ e* dx 


0 0 


9 -- 247, -| La as| 


=e —2 {fe — (ὁ — 1)} 


=p = 2. 


I 


Example 3 


[invax= [i x Inx dx 
] 

= sins [αξὰν 
x 


xInx—xcte. 


Example 4 


[-᾿ sinx dx = (—e 5) sinx — | (—e ~) cos x dx 


I 


—e *sinx + | e ~cosxdx 


+ Conveniently remembered as 


[υν-ὐ[ν- [(υ[ὴ 


9.5 INTEGRATION BY PARTS 


= —e ~(sinx + cosx) — [“ ποκα + ¢ 


> 2 |e sind = —¢ *(sinx + cosx) +c 


* (sin x + cos x) + ο΄. 


N|- 
eS 


=> [“ κα = - 


Exercise 9.58 


Find the following integrals: 


1 (7) [reas ὦ i) | xsin dss (1) [τὰ 
͵ ᾿ In x 
(1) [καρ ὦ )_ [το dx; (vt) | Bas 


x e-* dx; (it) |» sin x dx; 
Osec* 0d0; (iv) | 410" du. 


3 (1) [ποκα ti) | & cose dr 


eé ~cosxdx; (1) | e* sin 2x dx. 


x tan! x ἄχ. 


(In x)*; (22 ) [ tan“! xd 


5 Evaluate 
1 An 

(2) | xe ~dx; (11) | x sin 2x dx; 
0 0 


π 4 
(222 ) \ t* cos st dt; (iv) | u* In u du. 


1 


—e *sin x + (—e *) cosx — | (-- *) (— sin x) dx 


421 


6 A mathematical model is constructed to describe the situation in ἃ 
factory. The number of workers w which an employer will require is a 
function of the wage rate £7 per week for each worker where w = 100e 9-94". 
If the union wishes to maximize the total pay of its members required by the 
employer, what rate will it ask for? (Assume that all workers belong to the 


union. ) 


(S.M.P.) 
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Reduction formulae 


It is sometimes possible to find a formula which will reduce a given integral 
to a simpler one of the same type, and then to find the required integral by 
successive applications of this process. 


Example 5 


11 1:2 2ῈΞ | x" e * dx, evaluate J3. 
Now I, = x"(-—e*) -- | (ηχ" ἢ) (--Φ *) dx 


= —x εἰ + nfm e ~ dx 
=> I, = —x"e * + nl,_,.- 
Hence [, = —x° 4 + 31, 
| Ξ- --χῦ aia 2], 
I, = —xe* + Io. 
But Ip = -ε * +e. 
So J, = -χῖ εὐ + 3(—x*e * + 2/,) 
= —(x> + 3x7) ες." + 6(—xe7-* + Ip) 
= -- (αὖ + 3x* + 6x) e* + 61 
= —(x? + 3x* + 6x+ 6)e * +c. 


Example 6 
An 
= sin” x dx 
0 
an ἀπ 
I, = sin" x dx = sin" ' x x sin x dx 
0 0 
Dent Toate | an 
= [sin”” “ x (—cos x)], 


= 0 + (rn -- » | sin"? x cos* x dx 


= (n — 1) | sin"? x (1 — sin? x) dx. 


π ἀπ 
sin" 2 χἀάχ -- sin” x dx 


0 


TN 
© 
oo 
| 
SS 
| 
— 
rr, 
oO 
die 


n n-2 
n— 1 
= lL = τ τ 
7 


and this reduction formula enables us to calculate J, in terms of J,_ 4. 


But we know that 


9.5 INTEGRATION BY PARTS 


i= sin x dx = [—cos x] = | 
Ἶ 0 


and i= | Ι dx = [x])” = da. 
0 


So it follows, for instance, that 


I, = $1, = 3 x 31, =F 
and J, = @l, =3 X G1, τ ὃ 
5 
= 
Similarly, if 
imilarly, if J, = Ἢ 


op) 
Ο 
δ 
\| 
-.--α΄ 
© 


π 
$n 
-| sin" udu = 7. 


an 


Summarising, if J, = | 


0 


x xX x 
Nol ro} -- 
Ως 

τ ἢ 
ol 
σι 
a 


Aloo Aloo Lolho 


X ἐπ = 


Ow 
N 


cos " x dx, we can put 


cos"(5m — ἡ Χ —du 


4n 
sin” x dx, J, = | cos” x dx 


Exercise 9.5b 


1 Evaluate: 


(i) [ sint θ40θ: (ἢ) Ι 
0 0 


(iv) [ sin? 0d0; () |, 
0 0 


ἀπ 
cos* 6 dé; (122 ) | cos® θ dé; 
0 


cos® 6 dé. 
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2 Evaluate: 


| sin? 6 40: (ii) [ εος δά: “(δῇ | cos? 6 dO: 


0 


Gi) ᾿ sin? θ4θ: () | agin 8 ἀδ. 


— ae | 
an 3 


n-1°* 


3 If L= | (In x)" dx, show that J, = x(In x)" — nf 


Hence find Js. 


4 Use tan” @ = tan"~? @ (sec? θ — 1) to find a reduction formula for 


= | tan” θ d@, and hence find /, and J,. 


on 
μι 
"5 
Pom 
I 


᾿ | sec” θ d@, prove that 


oon, 
3 
+ 
— 
~~ 
Ἢ 
+ 
N 
| 


= tan # sec” 6 + ni,. 


9.6 Interlude: partial fractions 


At an early age we learnt such simplifications as 


2 1 10-3 7 


8. 5 Ι5΄ 15᾽ 
and, rather later, when we learned that 


I 1 Ga ae) 2 


See Aa. 5 


x-1 x+1_ (x — ᾿)(α - 1) ᾿ χῇ--Ἰ 

we had little doubt that this too represented a simplification, as 2/(x* -- 1) 

is generally more useful and more convenient than 1/(x — 1) — | /(x + 1). 
In the next section we shall frequently be needing to find such integrals as 


] 
an κι. 


for which there is no immediately obvious answer. But a little meditation 
might lead us to reverse the above process and say that 


jp 
ΠῚ 
ν[-Ξ 
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Obviously we were extremely fortunate to know that 


ἢ 1 
l Oe 


re | x— Ἰ ae ἢ 


and our next task will be to find a process whereby similar rational func- 
tions (i.e. the quotients of two polynomials) can be expressed in terms of 
such partial fractions. 

The full theory of partial fractions is beyond our present scope, but we 
can illustrate the procedure by a number of examples: 


Example 1 
Express in partial fractions 


x+il 
(x — 1) (x — 2) 
Hopefully, we write 

x+ ] oA B 


G2 oe) ao 


where A, B are constants which are to be determined 
=> l= Ae = 2) Be = 17. 

2, 3=Ax0+4 Bx! => f= 3. 

Puttingx = 1, 2= Ax—1+Bx0 > A= -ε-2. 


Putting x 


So the required partial fractions would be 


=2 3 


x— Ἰ roa 


and it is quickly verified that these can be combined to give the original 
expression. 


Example 2 
We now seek to put into partial fractions the more complicated expression 
2x7 — 3x? —x 45 

2x7 +x — 1 

First of all we observe that 2χ" — 3x? — x +5 can be divided by 
2x7 + x — 1 to give a quotient x — 2 and remainder 2x + 3. 
So 2x? — 3x7 —x + 5 = (Qe? + κ — 1) (x — 2) + (Qe + 3) 


Dig? no By ae καὶ. 5 2x + 3 
ἄμ περ ap ee a 
2x +x — 1 χε +x — 1 
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1 a ., 2x + 3 


Now .- τ , 
2χ' +x—1 (Ζ2χ -- 1) (χα - 1] 


sO we write, 


2% - 3 ae. ᾿ B 
(Qe —l)\(e +1) Q2e-1 «+1 
> 2x +3 = A(x +1) + B(Qx -- 1). 


Putting x = —1, 1=Ax0-3B => B= —3. 
Putting x = 5, 4 -- Ax > A=, 
So (as can easily be checked), 


eae 


. 25, - Be ae tS pie 
and ------- - ὁ =x - . 
Οχ +x — | 2x — | x+1 


In this case the key to our success lay in first dividing the numerator by 
the denominator, and wherever possible (i.e., unless the degree of the 
numerator is already less than that of the denominator) this must always 
be the initial step. | 


Example 3 


We now take a function whose denominator contains a repeated factor: 


x* + Sx + 9 
HG 2) 
Again, we first try the simple partial fractions: 
x? + 5x + 9 A B 
@+3)@ τ ἢ) χα @ +2? 
=> x7 4+ 5x +9 = A(x + 2)? + Bx + 3). 


70) Ξ 


(1) 


| 


Equating coefficients of x*, x, and the constant terms, we see that it 
would be necessary to have , 


l=A 
5=4A+ 8B which are clearly inconsistent. 
9= 4A + 3B 


So f (x) cannot be expressed in terms of partial fractions (1). 

For our next attempt, therefore, above the quadratic denominator 
(x + 2)* we place a term Bx + C, so that 

x*7+5x+9 — A Bx+C@ 


(x+ 3)(x +2)? x+3 (x + 2)? 
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=> x7 + 5x +9 = A(x + 2)? + (Be + C) (x - 3). 
Putting x = —3: = A => A= 3. 
Equating coefficient x7: 1=A+B => B= —2. 
Equating constants: 9=44+4+3C0 => C= -ἱ. 


Hence (as can easily be checked) the required partial fractions are 


w+ 5x +9 38 2x + | 


--- ττ"ἕ"ἕ"ἴ[ἐ“  .--  ἰ ----ς------------.- ὁ 


(τ 3. +2)? x +3 (x + 2)? 


Sometimes it is convenient to split this last fraction still further, by 
saying that 


2. ἘΞ _2(e*+2)-3 2 3 
(x +2)? (x + 2)? x+2 (x + 2)? 
oe x? + 5x + 9 3 2 i 3 
so that ———_—_——___,,|. = ———_ — ———. + ——__;: 
(xn + 33). +2)? χΞ 3 χ 2 (x + 2)? 


Example 4 


Finally, we take a rational function whose denominator contains a quad- 
ratic term which cannot be factorised: 


x — 3 


DE eae Nae 


As it is impossible to express x* + 1 in real factors, we follow the prece- 
dent of the last example and write 


x — 3 ee Bx+C 
Gael aa ee 
A(x* + 1) + (Bx + Ο (x -- 1) 
aaa Be Se 2 = 2A > A= » -|ἰ 
x? term: => O0O=A+B => B= 41 
constants: => -—-3=A—C => C= +42. 


I 


=> x — 3 


So it appears that 
x — 3 —] a oe 


Ἔα πα eal ae 


which can quickly be verified. 
Partial fractions are principally used in integration, but it is also interest- 
ing to see how they can help us to find the sums of certain series. 
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If, for instance, we consider the series 


] ] l 
[a og Se 


Ι 


we see that its rth term 15 --------- -----. 
(2r — 1) (2r + 1) 
This can easily be expressed in partial fractions as 


1 
2 


%—-1 Wt] 
ΕΣ ἢ 
IND pe | w+ 1) 


Hence the sum of the first 2 terms is 


l Ι ] ᾿ 
A Sc νλέ πὶ, nk tor ee .- -Ξ- ἐς τὸ 
‘ [x3 35 ST” On = (oa 


Nol|— 


Beare ois | ὍΤΕ, ae 4s Ι 
Π9}}1 38 3. 5 5 7 οῃ -1 M+] 


] | ] Ν n 
a Qn+1{ (ἢ +1) 
Also, asn— 00, S,—> 4: 


So the sum of the infinite series is 3. 


Exercise 9.6a 


1 Express in partial fractions 


ae ᾿ , ee Ι ; 

UY Gane Ξ he ge) eat: 
x? oe am . w+) 

wh Ge T= δε δ» αν. 
ae Oe ae. 

(vit) et (v2) — a 


2 Express in partial fractions 
] | x — |] 


(2) ae (1) aa Ay 


) 
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3 Express in partial fractions 


(2) 


] - l he ] (iv) x 
| | iy | 
[etn ea “eer e+] 


4 Find the first four terms of the power series for ea φτϑὴ 


by first expressing it 

(2) in partial fractions; 

(11) asl + (3x + 2x*)]7'; 
(111) as (1 + x)~*(1 + 2χ)Γ΄. 


5 Find the rth term, the sum of the first n terms and the sum to infinity of 
the series 


Ι 

τ ὠττ τι 

: | Ι ] 

τ στ το 

os Ι ] ] 

{111 βόε ξ ες SS ee ees 


1x2x3 2x3x4 3x4x5 


Use of partial fractions in integration 


In the next section we shall return to the general problem of integration, 
but we can already see the use of partial fractions when we need to integrate 
the quotient of two polynomials (or rational function, as it is usually called). 


Example 5 
] 
[= | = dx. 
, alee 


It is quickly seen that 


x+1 2 ᾿ 


= 31In (x -- 2) -- Ζ]ηὴ (x + 2) +. 
Example 6 


x7 
-| are 


We can find the full partial fractions in such a case, with a repeated factor, 


40 FURTHER CALCULUS 
by letting 
χ' Α B C 


pape eo ee ee 


=> x2 = A(x τ 1)2 + Bix — ) (e+ 1) + CH -- 1). 
Putting x = 1, 1 = 44 => A=. 
Putting x = --ἰ, L=-2C = CH=) 
Equating coefficient x?, 1=A+B 5 B=. 
a ee: 
So J= es Ce  ΟἸΗ 
᾿ (+s pe 
l 
= — ] ra J ———_ + 6. 
4 In (x aa πὰ 
Example 7 
l= x? = . 
(x — 1) (x* + 1) 
We can express this in partial fractions as 
x? _ A peer C 
(x—l)(e?7 +1) x-1l x*4+1 
So x? = A(x? +1) + (Be + C)(x -- 1). 
Putting x = l, ] = 2A => A=3. 
Equating coefficient x?, 1=A+B > B=z. 
Equating constant term, 0 = A-C = C=}. 
So xe 5 sx ἘΣ 
( -- ἡ τὺ χ--Ἰ x2 + 1 
dx 
Ξ- i 1 
oe : So+afatetifaty 


=4tin(x—1) +4¢ln(x? +1) Ἐ Σίδη ete. 


In this last example we encountered a quadratic term x? + 1 which 
cannot be split into real linear factors (and which is therefore said to be 
irreducible). As such terms, and their corresponding partial fractions, occur 
quite frequently, our next example will show how they can best be treated: 


Example 8 


᾿ 2x — 6 q ὃ " ἃ 
ae - 6, ὅς - τς δὰ, 
O) τ τὰ δα πὰ 
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111 ---------- dx 

va) [- — 6x + 25 

In (1) we immediately see that the numerator is the derivative of the 
denominator. 


266 
So |S α -ἰικοὐ - 6 + 25) + 


In (22) we first write 


dx. 
[π- το = ge gost ‘= | omar 2+ 16 


Putting x —3=4tan@0 = dx = 4sec* 0d we obtain 


] ἃ 4 ςες2 θ dé 
το πο ἐξ es 
x* — 6x + 25 16 tan? 6 + 16 


Ἢ 4θ = 1θ +c. 


But x — 3 = 4tan@ 


Ξ᾽ tan @ = 5 5 => Fee pet 
] _,x — 3 
So [eae τ θῶ : 4 + ¢. 


In (111) we express the required integral as a linear combination of the two 
previous parts: 


x 3(2x — 6) 3 
ieee eet ae τ “τὸς τὸ 
[πτέ το ΕΞ τ [aaa πὰ 


= 
= 5 In (x? — 6x + 25) + $tan “= + C. 


Example 9 


x 
i= | ———dx. 
loo a 


As χ + 1 = (x + 1) (x? — x + 1), 


we can express x/(x* + 1) in partial fractions: 


Hie a Sa ee ee ee 


So x = A(x? —x - 1) + (Bx + C)(x -- 1). 
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Putting x = —1, —-l=A(l+14+1) = A= -—}4. 
Equating coefficient x’, O=A+B8B => B= 43. 
Equating constants, 0=A+C => C= +34. 
Ss, —*% = 73 gx + 3 


dx dx 
But Ὁ ot ina 5 
x xt (x τ 5) +4 
3 3 
and we now let 1-4 = Brno => dx =“ sec? 949. 
d 1,/3 sec? 0 4θ 
Hence | =——— = hari 
x —~x+1 stan*@+ 4 
2 sec” 6 dd 2 2 
--- | —,~— = — | dd = —- θ. 
5 | aw =; | «(3 
-] 2 
But pea πποὴ => = =tand => 6@=tan1— 
2 JS 
So dx _ Sens 2x — 1 
x —x+t1 /3 /3 
and aS ΠΥ ee 
χ Ἐ ἡ 1 
τσ τ τὰ τ 2 Χ + 1 τοὶ 
ο΄ J3 
x? —x - Ι x -- Ἰ 
=! —- tan’ 
on ee toe + Soe 
Exercise 9.6b 


Find the following integrals: 


: dx ᾿ dx " dx 
I (1) [- Ὁ (11) |x is τ΄ (222 ) Ις- - fo? 
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! x? ἧς: dx dx 
eg. Oh ae A Vo a ad 


; dx (ii) dx 
a, Cee 1) ae a2) 
x+ 1 x? dx 
πῇ 4 
ait) | iP x: (qv) F Ξ ἢ 
2x dx x + 3 
ay: 
3 0) | +3 Ess +9 [233 
8x dx 2x -- 3 
és aos 
4) ee lars 4x2 + 9 [Ξ-Ξ "a 
as 2x — 6 x dx 
(122 ) > 
Branly ETT EST x“ — 6x + 10 
. x dx = aie aie és ἜΡΓ oa: 
4 (2) [os ΜῊΝ ait) | στ“; 
: x dx x? dx x dx 
2) [- ae τ \2 aa ve [- = 


9.7 General integration 


In the last section we learned to integrate rational functions by means of 
partial fractions. We now seek to draw together other methods we have 
employed for finding integrals. 

Faced with a particular function whose integral is required, our primary 
need is for a very thorough knowledge of differentiation so that we can try 
to find another function, or ‘primitive’, whose derivative is the given 
function. This may not always be possible: for instance, until we invented 
the function In x it was impossible to integrate even such a simple function 
as 1/x, and we shall frequently encounter similar examples. Nevertheless, 
our first step must be to re-state the standard derivatives and the integrals to 
which they give rise. 


ΟΣ n-1 "d pe ai τέ ] 
— (Xx = = oe 
dx Sia ὡ i n+ 1 ¢ (n ) 
d 1 d 

— (In x) = — [Ξ =Inx+e 

dx x x 

d x x x x 

a, ) =e e~ dx =e +e 

ae (sin x) = COS x | co x dx =sinx +¢ 
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—(cosx) = —sinx [sin xa = -—cosx+c¢ 
x 
d 
q, (tan *) = sec? x | sect = a = tanx +c 
x 

ΝΕ Ι αχ ee 
a, “sn 6) ~ J — x2) [--ἰῶ--- ὦ ‘x te 
d Ι d 
—(tan™' x) = Ξ : 5 =tan 'x +c 
dx l+x i eee 


These standard integrals frequently need to be used in conjunction with 
general methods such as that of substitution or integration by parts and, 
although these have been investigated in previous sections, considerable 
practice is necessary in order to sense what is the most appropriate method 
for each particular example. We have already stressed the crucial impor- 
tance of integration in finding areas and volumes, and it will soon be seen 
that it is similarly central in statistics, mechanics, and throughout all 
branches of applied mathematics. 


Example 1 


[= | ve — x*) dx. 


Here a substitution converts the integral into another which is more 
tractable, 


for x=asinOd => dx =acos@dé. 


So J= | ve — a* sin? θ) acos 6 4θ 
= a | cos? oad 


= 5a" | (1 + cos 20) dé 


= 5a°(0 + ¥sin 20) + ¢ 
= 3a°(0 + sin 0 cos 0) + ς. 


Finally, however, it is necessary to revert to the original variable x. 


So | ve — x*) dx = $a? sin”! x + gx/(a? — x”) +. 
Example 2 


— [ tan x dx. 
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Because tan~! x isa function which we can easily differentiate, there is 
an immediate attraction in using the method of integration by parts: 


f= [ean sas 


" | Ἶ 
=xtan x— |x dx. 


2 
So p= stants τα (as 


Ι 


χίδῃη ἷχ-- ξΙπ (1 + x7) +6. 
Example 3 
[= | cosec x dx. 


It may be recalled that sin x, cos x, and tan x are easily expressible in 
terms of tan $x: 


t = tan $x 
οἱ Ὥς ᾿ ot 
=> sn x => =>? COS xX = 5 anx = 
[ es 1+? a2 ἡ’ 


So the substitution 
t = tan $x 
is sometimes particularly convenient for simplifying trigonometric integ- 


rals. 


x 
Now ¢ = tan— 


2 
=> dt = gsec? $x dx = $(1 + #7) dx 
2 dt 
ἘΣ = 
τε 


_ f2dyl+e?) [4 
- | ΟΠ (1 + 15) -|faineve 


= Intangx +. 
Hence | cose x dx = Intan $x + ¢ 


and it is quickly verified that 
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— (In tan $x) = 
dx ' 2.) tan 5x 


Se ee see 
2singx cossx sinx 


To complete the catalogue of the integrals of elementary trigonometric 
functions, we recall that 


[tan xa [Ξξὼ 
COs x 
= log secx +c 


= —logcosx +¢ 
er [ secxdr = [ Sea 66 Ὲ Ὁ ane a, 


sec x + tan x 


x 


sec x tan x + sec? x 
sec x + tan x 


= In (sec x + tanx) + ¢. 


Finally, we give two examples which are quickly dealt with by a pre- 
liminary rearrangement: 


Example 4 


tan? x (sec? x — 1) dx 


Now [= | (tan* x) (tan? x) dx 


I 


| (tan* x sec? x — sec? x + 1) dx 


Φ τη χ —tanx texte, 


sin x 
= | —————— dx. 
sin x + cos x 


We first notice that we could easily find 
sin x + 
| sn cone 4 | aig 
sin x + cos x 


cos x — sin x 
and |S dx = In (sin x + cos x). 


sin x + cos x 
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sin x sin x + cose 
sin x + cos x 


So -------- dx = 3 
sin x + cos x 
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sin x + cos x 


cos x — sin x 
ΝΕΎΕΡΤΝ 


= 9x -- $ In (sinx + cosx) +c. 


Exercise 9.7 


Find the following integrals: 


3d 
(iii) | Ἐπ 
(v2 ) | (x + 1)* dx; 
(1x) [τ -- 1)" dx; 
. x dx 
(x22 ) | a = ΠΣ 


; 2 " 9 du 

2 (2) | ax (11) | aye: (111) F ae 
v dv x? dx 2x + 1 
(iv) [-Ξ 3) [. 3) es x; 
(vit ) | 9625 dx: (viii ) | 2 οὐ dx; (ix) | xe7* dx: 
(x) Ι ia ἀμ; (x2) | 10: de; (x32 ) dx. 


cos 2x dx; 


(SX) 
——, 


(12) | sin 3x dx; 


(221 ) | sin? 0 cos 6 dé; 
(v2 ) | sin? 6 dé; 


(ix) | cos* x dx; 


(xii) | ne ae 
cos 8 


(iv) | cos? θ 50.040: (v) | cos? 6 dé; 
(vii) | sinxcosxdx; (viii) | sin? 4x dx; 
© [geen oo [Shon 
4 (i) | sect xr (ii) | tan? vax 
(iii) | sec 6 tan 0 d@; (iv) | cosec θ cot 6 40: 
(v) | cosec? 6 dO; (vi) | cot? θ dé 
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(vit ) | tan 6 sec* 6 dé; 
(2x) | can 2x dx; (x) 


(x2) | cot 5x dx; 


dx 
5 (1) Fic = 2) 


(v227) | sec” 6 tan 0 dé; 


| tan” x dx; 


(x22 ) | cot? x dx. 


(x = 2 sin 6); 


(ii) eee (Qx = 3sin 0); 
(iii) | ; oS (2x = tan 6); 
(iv) | oo (3x = 2 tan 0); 
(2) [aut yw = tan 8); 
wi) [ot we 
(vit ) | vo —x*\dx (x = sin 6); 


(vitt ) | vo — x7) dx; 

6 (2) | soos. (22 ) i * dx; (11) | εἰμ sea 
(ww) | #?sin x (v) [ snr ads (vt ) | nade 
(vit ) | sec θ4θ: (viz) |= 6“ dx; (x) [tsi eae 
(x) | eos 3¢dt; (xt) ἡ Inxdx; (xii) [110 dx. 


7 Use the substitution ἐ = tan 5x to find: 


dx ° 
1 + sin x’ 


$n $n 
(2) | (2) | 


eS oe oe 
1 + sin x + cos x 


dx dx 


sx 


ἀπ 
(221 ) | Ss 
ο sin x + cos: 
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8 Show that 


|. fordr= [fea 


and hence (using no. 7) evaluate: 


ee. Se ον [27 x dx τς [3 x dx 
(ON ery "15 Se ξεεαξ ΝΜ 1}. ae 
οἱ + sin x o 1 + sinx + cosx* ο Ssinx + cos x 


9 Find the following integrals: 


10 Evaluate: 


f dx - 1 x dx . : dx 
(7) Ϊ αι 1)(x? + 1)’ (21) Ι (x? + 1)?’ ve) | (x2 + 1)?’ 


; dx an 
(1) | G+ 132? (v) \ sin” θ cos* 6 dé; 
0 


4n 
(v7) | sin* θ cos® 6 dé; (vit ) | x* In x dx; 
0 1 


vit) | V4 -- ἢ ae |" = 


᾿ 2 + cos 0’ 
ἀπ cos -- sin 0 
*) | oan” 


(x2 . sa (put ¢ = tan @); 
_i, 3sin? 6 + cos? 0 Dubie tan); 


(x22 ) Pe) (put x = 2 sin* θ + 4 cos* 6). 
— xX 


9.8 Differential equations 


We have seen that the process of integration is the reverse of differentiation 
and so requires the discovery of a function having the given rate of change. 
Frequently, however, we do not know either the function or its rate of 
change, but merely how they are related. 
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When, for instance in 9.2, we were investigating the cooling of bathwater, 
we simply knew that after time ¢ the excess temperature @ and its rate of 


d 
decrease ay one connected by the equation 


dé 
ae (1) 


and in the same section it was mentioned that the current x flowing in a 
particular electrical circuit obeyed the equation 
dx dx 


Ae are (2) 


Similarly the rate of growth of the mass m of an organism frequently 
depends on the equation 


ἀπι ἐπ, () 


and the rate of spread of a disease in a population, of which a fraction x 15 
infected, is often governed by the equation 


dx 
dr kx (l= Δ). (4) 
Any such equation which connects the rates of change of variables with 
their values is called a differential equation. Equations (1), (3), and (4) are 
differential equations of the first order and equation (2) is of the second order, 
so called from the degree of their highest derivatives. 
In particular, such a first-order equation as 


immediately leads to 


y=qrte. 


g.8 DIFFERENTIAL EQUATIONS 51 


This is called its general solution; and by giving ¢ a particular value we can 
obtain particular solutions, such as — 


y= ἰχ +2 or y= 4x3 -- 4. 


Graphically, we see that the general solution is represented by a family of 
curves, and that each particular solution is an individual member of this family. 


Two further examples will illustrate this relationship. 


Example 1 


If x2 + y% =a’, 


form a differential equation independent of a. 
As a varies, the equation 
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This is the required differential equation, which can be written as 
ie Τῆξε, ται 
x dx 


and since the gradient of OP is y/x, this equation is simply expressing the 
fact that the tangent to each circle at every point is always perpendicular 
to the radius. 


Example 2 


A point P is moving along a straight line and after time ¢ its displacement x is 
given by 
x = Asin 2ὲ + B cos 2¢. 


Form a differential equation which is independent of A and B. 
As we here need to eliminate two arbitrary constants, we shall have to 
differentiate twice and form a second order differential equation. 


Now x = Asin 2¢ + Bcos 2t 


d 
ΕΘ 2A cos 2¢ — 2B sin 2: 
di 
d?x . 
=> —, = —4Asin 2t — 48 cos 2t. 
di 
d*x 


This, therefore, is the required equation, showing that for all values of 
A and B the acceleration is always —4x, and we see that 


x = Asin 2t + Bcos 2ὲ 


is a general solution of this equation. 
More frequently, however, we start with the differential equation and 
need to find appropriate solutions. In this case, for instance, the equation 


might have arisen from investigation of a buoy bobbing up and down in 
the sea, and our task would generally be to show that this leads to a solution 
of the form 


x = Asin 2¢ + Bcos 2t, 


representing a periodic oscillation. 


Exercise 9.8a 


1 Verify that the following differential equations have the solutions indi- 
cated (where A, B, ¢ are arbitrary constants) : 
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d 
(7) a 53 LS As 
di 
d 
(22) ΠΑ = Dy: y = Ax? 
dx 
d 
(Gi) y2 =x; Pe —y =A 
dx 
dx l 
—= -_-_ =—— Ι] -- A = --.-..ΆΚὅς----. 
τ Oe ea gree εἰ 
d? d 
(v) τς τ ὅς +2 το; κα 44 Ὁ Be™. 
d? d 
(v2 ) a ἢ, + 5x = Ι: x = Ae ‘sin (2¢+ €) + 1. 


2 Find differential equations which are satisfied at all points of all 
members of the following families of curves (where A, B, a,c, etc. define the 
individual members of the families) : 


(1) xy = 6; (1) y= 4 δ; (111) ψ΄ = ax; 
(1υ) χ' -- γ' -- α΄; (v) y = Ax? + Bx; (wi) x =Aeé+ Β ε΄. 


We have now begun to see the relationship between a differential 
equation and its solutions. As with algebraic equations, the task of the 
mathematician 15 usually twofold : 

(1) to express a given situation— whether mechanical, electrical, chemi- 
cal, biological, economic, or whatever—in a suitable mathematical form 
(usually called a model), which very frequently leads to a differential 
equation ; and, 

(11) to find the appropriate solution of such a differential equation. Very 
often this has to be done numerically, but we shall begin to discuss the 
problem by taking three special types of differential equation for which it 15 
usually possible to find a general solution. 


First order equations with separable variables 


A typical first order equation 1s 


and if f (x, y) 1s defined at every point of the x—y plane, it follows that at 
each point a corresponding gradient and direction are known, just as in a 
magnetic field plotted by a compass needle. 


54 FURTHER CALCULUS 


Example 3 
dy χα 
dx y 


d 
The values of the gradient τ, can be indicated by a table: 
x 


2 3 
51 
> Ll 3 
Ι 2 3 
si. 22, 48 
-} -1 -3 
-$ - -1 


These can then be plotted at the different points of the plane and already 
we can begin to see the family of solutions, looking hike the lines of force of a 
magnetic field. 


But we can also approach the problem quite differently. For the equation 
dy «x dy 

— = - can be written as y— = x 

dx y dx 


d 
=> | vlax - [as 
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ψ 
“--.. 
ἘΞ 
eu 
SS 
Ι 
---.. 
δὲ 
onl 
ee 


ay = ax +e 
=> ae se 


This, therefore, is the general solution of the given equation; and we can 
obtain different particular solutions by giving different values to the constant c. 

Graphically, we see that each particular solution is a certain curve (in 
fact a rectangular hyperbola, each one having two branches), and the 
general solution represents a family of such hyperbolas. Finally, the sketch 
of these hyperbolas should be compared with our original ‘compass- 
needle’ figure. 


¢=2 
c=0 
c=] Desasces 
c= --2 
x 
c= -2 
c= --] 
ὁ: Ὁ c=] c= 0 
t= 2 
c=0 SS yt =x 
c= 1 > y=x? 42 
c=2 => y?> =x? + 4 
(Sal = γ Sa --2 
c=-2 => y=x — 4, εἰς. 


In the above solution we could have proceeded a little more rapidly by 
saying that 


dy x 

i (1) 
x Y 

can be written y dy = x dx (la) 

and then as | y dy = | x dx (2) 

= ay” = ἐχ +e 


ἐπ 
N 
I 
δὲ 
N 
+ 
NO 
- 
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Although the statement (la) has not (as yet) any proper meaning, we 
have already justified the argument from (1) to (2), and (la) clearly serves 
as a very useful intermediary. This is usually known as separating the variables : 
any first order equation which can similarly be expressed as 


I (y) dy = g(x) dx 


is said to have separable variables, and its general solution can be written as 


Example 4 


Solve the differential equation 


dy ααχ 
y x 
=> Iny= —-Inx+e 


=> Inxy=c 
=> xy =a _ (wherea = δ΄). 


The solution-curves are therefore: 


a= +3 
α- it? 
a= +1 
0 a= 0 
+] a= —] 
+2 a= —2 
+3 a= --ὁ 


Lastly, we see that there is a relationship between the two examples we 
have chosen, 
dy x 


= d 
dx y ee dx x 


dy Κ 


Since x/y x y/x = —1, it is clear that the directions at any point of the 
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solution-curves of the two equations must be perpendicular, and this is 
very clearly seen when the figures are superimposed : 


Such pairs of families of curves which are perpendicular everywhere they 
meet are called orthogonal families and are outstandingly important not 
only as lunes of forces and their corresponding equipotentials, but throughout 
applied mathematics. There is indeed a particularly simple illustration of 
them on a map as the contours and the lines of greatest slope of a hillside, and 
this last diagram can be regarded as the map of a saddle-point with rainfall 
streaming from the hillsides in the North-East and South-West into the 
valleys in the North-West and South-East: 
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Example 5 


A stone falls vertically from rest against an air resistance which is propor- 
tional to its velocity v, so that its acceleration is g — kv, where g and k are 


constants. 


(i) Write down differential equations connecting v with ¢, the time; and 


υ with x, the distance fallen. 


(1) Find expressions for v in terms of ¢ and x in terms of ¢. 


(11) Find the terminal velocity, as ¢— ©. 


(1) The downwards acceleration 


_ do ἀν de dv 
adi di dx dx 
dv 
S —=g-— 
O 7 g— kv 
dv 


(11) From (1), we see that 


iu αἱ ( kv) t + 
"ἰδ ee πὴ ee = = C. 
as ras g v 
] 
But νῃεη ἐξ 0,v Ξξ θ; so ¢ = —7 ing. 
] ] l —k 
Hence -t=7In(g -- be) ~zing=zin® P : 
k 
- nr τ = —kt τς ee 
ὅδ ἕ 
=> y= = (1 —¢ *) 
Hence — == (1 gy 
> ἐπέ(εε τι “14 
But when t = 0,x τ 0; so d= -ἰν 


" 
Hence x = = + τς Τα 1), 


(4) 


Finally, we see from (3) that as t—> οὐ, v— g/k, which is therefore 
called its terminal velocity. (And the same value can be found more simply 
from equation (1) if we suppose that the acceleration eventually tends to 


Ζετο.) 
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Example 6 


The rate at which a tree takes in its ‘food’ is proportional to the area of its 
rooting surface. The food is used 
(i) to maintain the tree — this requires an amount proportional to its 
volume, and 
(11) for it to grow — this requires an amount proportional to the rate of 
increase of volume. 

If the complete tree, including roots, retains a constant Snape show that 
its height / satisfies a differential equation of the form 


οἰ Ah=B 
apo Ξ 
and that the tree will tend to a maximum height. 

A tree is planted when it is 2 m high and its initial rate of growth in 
height is 0.3 m per year. Its final height is 100 m. How long does it take to 
reach a height of 50 m? (M.E.I.) 


This example shows how a so-called ‘mathematical model’ can be set 
up to describe a given situation; whether or not the model is valid can then 
be tested from the accuracy of its predictions. 

Suppose that after time ¢ the volume of the tree is V and the area of its 
rooting surface is 5. Then, since the tree retains a constant shape, 


= αἰ", V = δι") (where a, ὁ are constants). 


Also the rate at which it takes in its food is AS. 


dV 
kS =cV+d a (where k, c, d are constants) 
h 
=> kah*? = cbh? + d x 3045 © 
dh 
=> ka = bch + 3bd — 
di 
al + Ah =B ] 
=> — = ‘ 
7 (1) 


When ἐξ 0,4 = 2 and “τ = 0.3, 50 

0.3 4+ 2A = 8. (2) 
: . dh 

Also, as the tree nears its final height, A — 100 and ΕἸ — 0;so 


100A = 8. (3) 
From (2) and (3), A = 0.3/98 and B = 28. 
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dh 0.3 30 


H ee ee ee 
ence di? 98 98 
=> og” _ 93 100 — A 
di on ) 
98 dh 
=> ——— = 0.3di 
100 —h 


=> —98 In (100 — Δ) = 0.3t 4+ A. 
But when ἐ = 0, ἡ = 2;s0 


—98 In 98 = A 
98 


3 = ---ἰ-. 
=> ΟΞ 98 In τ (4) 


Putting A = 50, 
0.3t = 98 In 98 


98 
> t = —I1n 1.96 
0.3 


98 x 0.672 9 
= ——__—_———_ & 220. 
0.3 


So the tree would take just over 220 years to grow to a height of 50 m. 
Moreover, from (4) we can obtain a more general formula for its height ἡ 
after ¢ years. 


100 — A 0.3t 
For In ——— = 


98 98 
a 100 — A — 2 9-31/98 
98 
=> h = 100 — 98 ¢ 9-598 


-- h — 100 -- 98 eg 9:00306t 


9.8 DIFFERENTIAL EQUATIONS 
Exercise 9.8b 
1 For each of the following equations, 


(a) illustrate the equation by means of a compass-needle diagram ; 
(6) find and illustrate its general solution: 


d d d 
(ἢ) τῇ τ ; Gi) τῇ τ αν; (iti) τ = 2x; 
dy ᾿ ὸ dy y dy 7 2x 
(v0) a ἃ Ὁ dx χ᾽ 7) dx sy 


2 For each of the following families of curves, 


(a) eliminate the parameter ¢ to form a differential equation which repre- 


sents the family (i.e., is true at every point of every curve) ; 
(ὁ) illustrate by means of a compass-needle diagram ; 


c) find the differential equation representing the orthogonal family ; 


( 
(4) find and illustrate the general solution of this equation: 
(1) y = x; (1) y=x+eo; Gn y= ex 

ΒΨ ( 


(Ὁ) y = χα τ ὃ; (v) y=ce*; ἢ y= 


3 (1) Solve the differential equation 


dy 
Ay = tan y cot x, 
given that, when x = $7, y = in. (0.c.) 
(11) Find the solution of the differential equation 
x τι = sin y 
x 
(for x > 0) which passes through the point (2, 57). (S.M.P.) 


4 Thesize S ofa population at time ¢ satisfies approximately the differen- 


. ds ; : 
tial equation ao kS, where k is a constant. Integrate this equation to 


find S as a function of ¢. 


The population numbered 32 000 in the year 1900 and had increased 
to 48 000 by 1970. Estimate what its size will be (correct to the nearest 
1 000) in the year 2000. (S.M.P.) 


5 Ifa ship’s mooring rope is wrapped round a rough bollard, with co- 
efficient of friction μ, and is on the point of slipping, it can be shown that 


its tension 7’an angle θ from its taut end is such that 
dT 


_uT 
40 μ 
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Deduce that 
Let ge. 


where 7, is the tension from the ship. If μ = 5, show that one complete 
turn will reduce this tension in the ratio 1 :e” (or approximately 1:23). 


6 Ifthe rate of erosion of mass of a spherical pebble is proportional to its 
surface area and it is half-eroded after 100 years, how much longer will it 
last? 


_ dv : ee τ πὲ ; ; 
7 Show that the acceleration ΕΥ̓ of a particle moving in a straight line 
can be written, in terms of its velocity v and its displacement x from a point 
Shave 1 dv 
of the line, in the form v a 
Χχ 


Ata distance x km from the centre of the Earth the gravitational accelera- 
tion in km 5. 2 is given by the formula ¢/x* where c = 4 x 10°. Ifa lunar 
vehicle 10 000 km from the centre of the Earth is moving directly away 
from it at a speed of 10 km s~*, at what distance will its speed be half that 
value? (S.M.P.) 


8 A rumour is spreading through a large city at a rate which 15 propor- 
tional to the product of the numbers of those who have heard it and of 
those who have not heard it, so that if x 15 the fraction who have heard it 
after time ἐ, 


dx kx(1 
— = kx(1 — x). 
di 
If initially a fraction ¢ of the population has heard the rumour, deduce 
that 
ὃ 


a ee περ 9 ἢ 


If 10% have heard the rumour at noon and another 10% by 3.00 p.m., 
find x as a function of ¢. What further proportion would you expect to have 
heard it by 6.00 p.m. ? 


g The inhabitants of a country are in two racial groups which do not 
mix reproductively. Each group’s population increases at a rate propor- 
tional to the population. At a certain stage 80% are of type A, type B will 
double in 50 years, but the whole population will take 100 years to double. 
How long will it take group A to double? How long will it take before the 
two groups are equal in numbers? (M.E.I1.) 


10 Heat is supplied to an object at a constant rate Wa joules 5. ἢ; heat 
is lost to its surroundings at a rate of ὁ 78 joules 5. *, where W is the number 
of joules needed to raise the temperature of the object by 1 °C, 6 °C is 
the difference in temperature between the object and its surroundings, 
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and a, 6 are constants. Obtain a differential equation to connect @ with 
the time ἢ. 

Use this mathematical model to solve the following problem. An electric 
kettle, open to a room temperature of 20 °C, reaches 70 °C in 3 min after 
being switched on, and reaches boiling-point in a further 3 min. Find the 
time taken to reach builing-point from 20 °C if it is completely heat insula- 
ted. 

What deficiencies has the mathematical model in representing the actual 
physical situation? (M.E.I. ) 


11 The food energy taken in by a human body goes partly to increase the 
mass and partly to fulfil the requirements of the body; these daily require- 
ments are taken to be proportional to the mass M. The rate of increase of 
mass is proportional to the number of joules available for this. Write down 
a differential equation connecting MM, the time ¢ and the daily intake of 
joules f(t). 

A man’s mass is 100 kg; if he took in no energy he would reduce his 
weight by 10% in 10 days. How long would it take him to reduce by this 
amount if, instead, he took in exactly half the number of joules needed to 
keep his mass constant at 100 kg. (M.E.I.) 


12 Astone is thrown vertically upwards with velocity uw, and air resistance 
is proportional to the square of its speed (= kv* per unit mass). 
Find: 
(¢) the differential equation connecting its velocity v and the height x it 
has then reached ; 
(11) xin terms of v; 
(111) its maximum height; 
(10) its terminal (or limiting) velocity. 


13 A flask contains 10 dm? of a solution of water and a thoroughly dis- 
solved chemical. Each minute, 2 dm° of water flow into the flask and 
3 dm? of solution flow out of the flask. Write down an expression for the 
number of kilograms of chemical per cubic decimetre in the solution after 
ἐ min. If there are x kg of chemical in the flask at this time, show that 
dx OX 
di 10-¢ 
Find the general solution of this equation. If there were 1 kg of chemical 
in the flask at time ¢ = 0, find the mass of chemical in the flask after 5 min. 
(A.E.B.) 
14 The rate of increase of thickness of ice on a pond is inversely propor- 
tional to the thickness of ice already present. It is known that, when the 
thickness of the ice is x cm and the temperature of the air above the ice is 
—« °C, the rate is (#/14 400 x) cms~!. Form an appropriate differential 
equation, and hence show that, if the air temperature is — 10 °C, the time 
taken for the thickness to increase from 5 cm to 6 cm is a little more than 


2 ἢ. (c.) 
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Linear equations of first order 


A particularly important class of differential equations consists of those 
which are linear, where y and its derivatives occur only in linear combina- 
tions even though their coefficients may be functions of x. So a linear 
equation of the first order can be written as 


d 
γὼ = + ἐν = h(x). 6) 
x 
Example 7 
d 
x? a + 2xy = | 
dx 


We quickly notice that this is a particularly convenient equation since it 
can be written as 


2 
aX = |]: 

pas 

and, as the left-hand side is an exact derivative, the original equation is 
also called exact. 

Proceeding with the solution, we see that 


xy=xt+A 
1 A 
re 


If we now return to the more general linear equation (1) we see that it 
can be written as 


or —+ Py=Q (2) 


where P and Q are functions of x. 

This is usually taken as the standard form of the first-order linear equa- 
tion. Unfortunately (unlike Example 7) it is not usually exact, so we now 
investigate whether there is any way in which it can be made into an exact 
equation. 

If we multiply each side of (2) by another function, denoted by A, it 
becomes 


dy 
dx 


R— + PRy = QR, 


and we see that the left-hand side would be our exact derivative if only it 


d 
could be written as ae (Ry). 
x 
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dy dR 


d 
But Ry) = R—- + — 
Ue ap i ae 


dy 


So R ἘΣ + PRy is an exact derivative provided that 
Χ 


dR 
dx 
dR 


= PR 


<> in R= | Pas 


<_ R ἊΣ οἱ dx 
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As this function enables us to transform (2) into an exact equation, 
οἱ Ὁ ἄχ is usually known as an integrating factor. By this means the equation 


becomes 


41} ἀχ — 4 PSP axy — 4" 6» 


d 
= ae Ree 


and the problem is reduced to one of ordinary integration. 
Two further examples will make this clear: 


Example 8 
dy 
oe = ix ῃ. 


a. ees Ee ΣΟΥ 2 
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Unlike Example 7, this is not an exact equation. But it can be written as 


dy 
~*~ _y=x, 
dx 7 
so that a possible integrating factor 15 
pi P dx ἘΞ οἱ 1 dx =p ©. 
H 7x y πχ ΒΕ. _—x 
ence ¢ *— —e “y= xe 
dx 
d Tx x 
=> — = xe 
ce. 
=> e y= iz a6 bs 
Sage Mae FHA 
=> yeAg HA = 1; 
A = 3 
y 
A=? 
A=0 
» : 
A=0 
A= --ἹἸ 
Ae? odd 
Example 9 


d 
sin x — + 2ycosx = 1. 
dx 


In standard form, this becomes 


d 
aa (2 cot x) y = cosec x 


dx 
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and the necessary integrating factor is 
ei? cot x dx 2 Insinx ein sin? x ΝΣ 


=e = 


sin? x. 


So the equation becomes 


d 
(sin? x) τι + (2 sin x cos x)y = sin x 
x 


=> — (ysin* x) = sin x 
dx 
=> ysin? x = —cosx + A 
A — cos x 
=> | Ξ 9 
sin” x 


Exercise 9.8c 


1 Find the general solutions of the following equations: 


d 

@) Baxny Gi) eax Ἐν, 
dx 
d d 

(111) x— — Qy = (x — 2) e; (iv) τι Ἐν =x 


2 Find the solutions of the following equations such that y = 1 when 
x = 0: 


d 
(δ l—-x)2-we=x; (i) (1 + x) = aly + 1). 


4 The current 7 ina certain electric circuit satisfies the equation 


di Ἵν 4 
—+21=sint 
αἱ 
If it is known that 2 = 0 when ¢ = 0, find 1 in terms of ¢. ‘To what does 
this approximate as 1 - oo? 


4 A radioactive substance P decays and changes (without loss of mass) 
into a substance Q, which itself similarly changes into a third substance R. 
R suffers no further change. The masses of P, Q, and R present at time ¢ are 
given by p. g, and r respectively. The rates of change are such that 


dp dr 

oy ae Ped ate ] 
AG 2p and rr g (1) 
Show that 

dg 

— = 2p — 4. 

cS p—-q 


Initially (at time ¢ = 0) there is 1 g of substance P and none of sub- 
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stance Q. Integrate equation (1) and hence show that g satisfies the dif- 
ferential equation 


dg 


+g=2e * 2) 
ae - (2, 

Show that (2) may be written in the form 
d 

t = 
— (ge S26; 
a; (9) 
and integrate to find q as a function of ¢. Hence prove that, at any sub- 
sequent time, there is never more than 3 g of Q present. (S.M.P.) 
YE] 
Eo sin wt Ey sin cot 


(i) (22) 


In each of the above circuits an alternating potential Ey sin οἱ is applied 
and the current 7 satisfies the following equations (respectively), where 
k, L, Care constants, known as resistance, inductance, and capacitance. 

dz 


di ἢ 
(ἢ) Lo + Ri=Egsinot; (ii) Re + Gi = WE, cos wt 


In each case, find: 
(a) the appropriate integrating factor; 
(ὁ) the general solution for 1, as a function of ¢; 
(c) the ultimate ‘steady’ alternating current (when ¢— οὐ and any 
‘transient’ terms have faded away). 

[It may be assumed — or proved by repeated integration by parts — that 

kt 
| e sin wt dt = pea ee (A sin wt + ὦ cos wt) 


kt 


fo a? (@ sin wt — k cos cot) |. 


and | e* cos wt dt = 


Linear equations with constant coefficients 


A particularly important (and fortunately simple) linear differential 
equation 1s one in which the coefficients of y and its derivatives are all 
constants. We shall consider the second-order equation 
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d? d 
«τὰ tb ty = f(x) (1) 
where a, ὦ. ¢ are constants. 

The solution of such equations can be divided into two parts: 

(a) finding a general solution of the associated equation: 

d? d 
ath ty =0 (2) 
in which f(x) has been replaced by zero. This solution is known as the 
complementary function (c.f.). | 
(b) finding a particular solution of the original equation (1). This is called a 
particular integral ( p.t.). 

These processes will be clear when we take particular examples, but the 
general method can be summarized as follows: 


(a) The complementary function 
To find a general solution of the associated equation 

d*y ἂν 

--Ξ + b5— = (, | 
"ae = dx oy (1) 
we put y = e”* and observe that this is a solution provided that 


am- e™ + bmée™ +cé™ =0 


«» am? + bm τα = 0. 


So if this equationt has roots a and f, it follows that e and e’* are both 
solutions of equation (2). 


Moreover, 
if u = Ae™ + Be** (where A, B are any constants), 
d 
then — = dae™ + BB εἰ" 
dx 
ἄξω ὃ ᾿ 
and —> = Aa? e* + BB* 6". 
dx 


+ The very important case when am? + bm + ¢ = 0 does not have any real roots will be 
considered in 10.5. Meanwhile, we see that if it has identical roots a, a, then 


u=Ae~+ Be™ = (A + B) &™, 


which effectively contains only one arbitrary constant A + B, rather than two independent 
constants A and B. But in this case it can quite easily be shown that x e™ is another solution, 
so that 


u= Ae™* + Bx e™ 


is also a solution, thus restoring two independent constants. 
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d? d 
Hence ἜΣ Ἔ b= + cu 
= a(Aa? οἷ + BB? &*) + b(Ax δ᾽ + BB εἶ) + c(A δὴ + BeF*) 
= A(aa? + ba + ὁ) e™ + B(aB? + bB + c) && 
=0+0=0. 
So u = 4 ον + Be is also a solution of (2). This is therefore the com- 
plementary function. 


(b) A particular integral 


The discovery of a particular solution of the original equation is frequently a 
matter of trial and error, and we shall restrict ourselves to the simplest 
cases. But once we have found such a p.i. we can immediately combine it 
with the c.f. to provide a general solution of the original equation: 


For suppose that v(x) is a particular integral. 


; d 
Se ΡΟ ΘΕ 


d 
Then a qe ae 


But the complementary function u(x) was such that 


d7u du 
— > + b— = 0. 
oa ar 
d? d 
Hence a— 5 (utv)+6—(ut+v) +clutv) =f (x) 
dx dx 


and we see that u(x) + v(x) is a general solution of the original equation. 
So for such a linear equation, 


general solution = complementary function + particular integral 


Example τὸ 

Find a general solution of the differential equation 
ἀν ἀν 

- - -" - by = is 

dx? x dx ee 

(a) Complementary function 


The associated equation is 
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and y = e” will be a solution provided that 


2 = 4m e™ = 6 e™ = 0 


m 

<> m+m—6=0 

<> (m — 2\(m + 3) = 0 

<> m=2 or —3. 
So the c.f. 15 


y= Ae’* + Be **. 


(b) Particular integral 

If we seek for y as a function of x such that 
d*y ἀν 

ao ee ON = ] 

dx? 2 dx Pa 


we naturally first look amongst simple linear functions of the form 


y= ax t+ ὁ. 

dy 
Th 2 = 
a dx 

d?y 
and a 
ς ἀν ἂν : “ἢ 
2 oa Ge = 

dx? dx ans 

= 0. τὰ 0 (ane δ) = eel 
<= a=-t+ and a—-6b=1 > b= -¥. 
Soapi.is y= --χ —3% 


and a general solution of the original equation is 
y= Ae* + Be ** — x —%%. 
We can clearly also apply this method to similar first-order equations. 


For instance, the equation of Example 8 can be written as 


Example 11 


and we can immediately obtain the necessary complementary function 
and particular integral: 


(a) Complementary function 


Put y = e”” in the associated equation 
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dy 


Then me™ — &* =0 
=> m= 1 
so that the c.f.is y = Ae 


(b) Particular integral 


First we try y = ax + b. This would be a p.i. provided that 


a—(ax+b) =x 
> --—-ax+ (a—b)=x 
=> a=-l, a—b=0 
=> a=b=—|] 
So a p.i. 15 y= —-—x-l. 


Hence the general solution of the given equation is 


y= Ae—x—- Ἰ. 


Example 12 


If an alternating potential is applied to a certain electric circuit then (using 
suitable units) the current x after time ¢ is such that 


d?x dx . 
ΣΎ τ 3x = sin f. 


Find the general solution. 


(a) Complementary function 


The associated equation is 


and χ = e™ is a solution provided that 


me + 4me™ + 30% = 0 


=> m= + 4m+3 = 0 
<> m= —1 or —3. 
So the c.f. is x= Aet+ Be *. 


(b) Particular integral 


Perhaps we would first try to find a function of the type 
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x = asini 


which satisfies the original equation for all values of ἐ. 
But this would entail 


—asint + 4acost+ 3asint = sint 
=> 2asint + 4acost = sint. 
This identity is clearly impossible, so we now try 


asint + bcost. 


x 


acost — bsint 


| In thi εν 
n 18 Case Ἔπεσε 
S 5 di 


and —— = —asint — bdcost. 
dt? 


So (—asint — bcost) + 4(acost — bsint) + 3(asint + ὁ cost) = sint 

=> (2a — 4b) sint + (4a + 26) cost = sint 

=> 2a -- 4 = 1 
4a + 2b = ᾿ 


and we see that 
x= 7osint-—<g¢cost isap.l. 
Hence a general solution is 
x=Ae'+ Be ** 4 τς (sint — 2cos ἢ. 


In a particular case, the constants A and B would usually be determined 
by the initial conditions, e.g., from one’s knowledge of the current x at 
ἐ = 0 and how it was changing. But it is noticeable that both terms of the 
complementary function Ae‘ + Be * tend to zero as t-> 00; and this 
happens very rapidly, so that in every case the steady-state current after 
the initial surge (or other transient effect) 1s 


x κ᾿ τ (sin t — 2 cos ἢ). 


Exercise 9.8d 

1 Find the general solutions of: 

(1) “ - 4: + 3y = 0; 

Gi) $4 — 44 ον = 0; Git) TY — ty = : 

(in) £4 Y= 0; (v) TY 4h 4 ty = 0. 
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2 Verify that the given expressions are particular integrals of the corres- 
ponding equations, and hence find their general solutions: 


d*y y 
— =~ j= = 6 = 3 
ὩΣ 
we ἀν κἂν 
(21) qa ode + YF y = (2x? + 6x + 7). 
sae ee dx 
(172 ) "τ + 3— +2x=¢: x=<eé 
(τυ) αἰ + 30 + 2x =sint; x = 7o(sint — 3 cos ἢ). 
dt? dt me 


4 For each of the following equations find the complementary function, 
a particular integral and the general solution: 


(1) “ἢ ἡ _ ay τς 
(22 ) “ + get — 3y=e*; 
(222 ) a - 3: + 2x = sin ; 
(iv) i + 35: + 20 = cost. 


4 Find the solutions of the following equations which satisfy the given 
initial conditions: 


. d*y dy dy 

(1) qo aed y= 3 and rae when x = 0; 
d? d 

(ἡ) Gz = 4 x=0 and τ = 4 when ¢ = 0: 
d*@ dé dé 

na Fa ee θ-- ὁ and Ὁ when ¢ = 0. 


5 The needle of a certain heavily damped instrument makes an angle 6 
with its zero position after time ¢, and it can be shown that 
450 dé 
--ς + 5— + 60 = 0. 
di? dt 

Find an expression for θ in terms of ¢ if initially (1.e., when ἐ = 0) the 
needle was at the zero position but moving at 2 rads‘. Hence find the 
maximum deflection of the needle and sketch the graph of @ against ¢. 
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E 


Ifa potential Eis applied to a simple circuit with inductance ZL, resistance R 
and capacitance C, it can be shown that the charge g on the capacitor after 
time ¢ must satisfy the equation 


Find the general solution of this equation for the circuit in which 
L=5 x 10°° (henry), R = 6 (ohm), C = 10 ° (farad) when the applied 
potential 15 100 (volts), and show that the charge on the capacitor very soon 
becomes τὸ C. 


7 In Exercise 9.8c no 5 (1) and (77), (a) find the corresponding comple- 
mentary functions and appropriate particular integrals. 
Hence proceed to parts (4) and (¢). 


9-9 Hyperbolic functions 


Earlier in this chapter we introduced the two functions e* and e * and we 
now proceed to consider their semi-difference and semi-sum. For reasons 
which will soon become clear, these are called sinh x and cosh x, the. 
hyperbolic sine and hyperbolic cosine (or, more familiarly, ‘shine’ and ‘cosh’). 


So 


Firstly, we see that 


sinh (—x) = (97 " — e*) = —sinh x 
and cosh (—x) = $(e * + e*) = cosh x; 
=  sinhx 15 an odd function 


and coshx 15 an even function. 


x 


This is also clear from considering the graphs of e* and e *, and of 
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sinh x and cosh x: 


ied 65, 
Ν 7 


The first similarity with the trigonometric functions is noticed if we find 
the derivatives of sinh x and cosh x. 


d d 
For q, (sinh x) = τς f2(& — e™)} 
dx 


dx 

= 5(e* + e *) = coshx 
d d 
dx (os *) = a ie Fe *)} 

ἘΞ 2 (ὁ - ὁ. τὸ’, sinhx 


Further, we notice that 
cosh? x = $(e* + e*)? =1 
sinh? x = $(e* — e~*)? = 1(e?* — 2 + , 35). 
l 
1 


So cosh? x — sinh? x = 


and cosh? x + sinh? x= 5(e** 4 πον 
= cosh 2x 
Also sinh 2x = $(e?* — 2x 
= H(e)? — (55 
=e Ne + 
= 2x He - σὴ x ME +e) 
=> sinh 2x = 2 sinh x cosh x. 


Summarising these elementary properties, we see close similarities, as 
well as subtle differences, between the hyperbolic and the trigonometric 
functions, the only property of the latter without parallel being that of 
periodicity : 
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d d 
— (sinh x) = cosh x — (cosh x) = sinh x 
d dx 


x 
cosh* x — sinh? x = 1 
sinh 2x = 2 sinh x cosh x 


cosh 2x = cosh? x + sinh? x 


Lastly, and again by analogy with the trigonometric functions, we 
define 


sinh x cosh x 
tanh x = coth x = ——— 
cosh x sinh x 

] ] 
sech x = cosech x = — : 
cosh x sinh x 


Exercise 9.98 


1 Show that the hyperbola 


can be represented parametrically by the hyperbolic functions 


x = acosh 8@, y = bsinh 0. 


d 
Find “Ψ in terms of θ. 
dx 


2 Show that: 

(1) tanh x = 

(11) tanh x is an odd function; 
d 

(11) — (tanh x) = sech? x; 
dx 

and sketch the graph of tanh x. 


3 Differentiate with respect to x: 
(1) sinh 2x; (1) cosh 5x; (iii) x cosh x; (10) In sinh x; 
(v) In cosh x; (vt) sech x; (viz) cosech x; (vit) In tanh $x. 


4 Find the following integrals: 


(1) | sinh x dx; (22 ) | cosh 2x dx; (121 ) | sech* x dx; 
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(1) [ tanh vax (v) | » cosh xd (v2 ) [ ssech® x dx 


5 Calculate: 
(2) the area beneath the curve y = cosh x between x = —1 andx = +1; 
(11) the volume formed when this area is rotated about the x-axis. 


6 Use Maclaurin series to find the power series for: 
(1) sinh x; (11) cosh x. 


7 A bead is at rest on a straight smooth horizontal wire which is then 
made to rotate with constant angular velocity w about one of its ends. It can 
be shown that its distance r from the point of rotation will satisfy the 
equation 


d?r ὃ 
dz -- OT= 0. 
Show: β 


(1) that the general solution of this equation is r = Ae’ + Be” 


and (11) that if initially , = a, then r = acosh wt 


(11) that if the wire has rotated through an angle 6, then θ = wt and the 
polar equation of the bead’s path is r = a cosh 0. 


Inverse hyperbolic functions 


We see from a graph that the function sinh x sets up a 1—1 mapping of the 
real numbers R on to R, so that its inverse function is defined throughout R 
and we can write: 


y=sinhx < x =sinh’ y. 
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Now y = sinh x = 3(e* — ¢ *) 

se --γοῖ = l= 0 

=> Saye faye eT) (since e* > Q) 
=> x=In[y + J(y? + 1)]. 

So sinh~' y = In [y+ J(y¥’ + 1)]; 


and just as sinh is related to the exponential function, so sinh“ 15 related 
to the logarithmic function. 

The function cosh x is slightly more awkward, for 1t maps all x € R on 
to the range y > 1, and each of the numbers of the range (except y = 1) 
arises from two equal and opposite members of the domain. 


If, however, we restrict the domain to x > 0, we can obtain an inverse 
function, and write 
y=coshx (x Ὁ Ὁ) = x=cosh 'y (y 21). 
Now y = coshx = 5 
e’* — IyveX +1=0 
Cy —2ye 1 = 0 
y 


Vy yd 
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But ῃ -- (y? — 1) = ——;>_— 
ν y+ ψῳ -- 1) 


l 
Ee CC ee eee es οι τ ξξεῖεξειυ- 
= -n[y + J(y’? - 
Hence x = +In[y + /(y? — 1)]. 


But x20, so x= +In[y+ J/(y? -- 1)] 
<> cosh! y = In[y + ./(y? — 1)]. 


We can now write these inverse hyperbolic functions as 


sinh~' x = In [x + ./(x? + 1)] 


cosh™! x = In [x + J (x? — 1)] 


Their main importance is in the calculation of two standard integrals: 


dx (ii) dx 
Trey? leat) 
dx 
τ ἢ: 
put x=sinhu => dx = coshudu. 
dx cosh u du 
Th —$———_— =  ----ἰς.------- 
[πὶ +1) | see u +1) 
2 cosh u du 
7 cosh u 


= | a 


u+c 
= sinh ! x +. 


ΤΩΣ 


put x=coshu => dx = sinhudu. 


Th dx sinh u du 
νὼ J (x? = 1) 7 J (cosh? u — 1) 


" sinh u du 
- sinh u 
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Exercise 9.9b 


1 Express as logarithms: 
(1) sinh’ 3; (1) cosh~' 2; (11) tanh” ' x. 


2 Find the following integrals: 
dx és dx = dx 
(2) ( oe (22 ) | arp (221 ) | sates: 
dx dx dx 
(10) ΕΣ (υ) ΕΞ (υ1) |=: 
(vat ) | sinh’ x dx; (v221 ) | cosh’ « dx; 


(ix) | vee + 1)dx; (x) | ver — 4) dx. 


4 Evaluate: 


Ξ dx ΤῊΝ ἢ: αχ 
(2) : Ve +1)’ (22 ) » J(x2 — 4)’ 
(111) (x? -- 1) dx; (1v) sat + 4x*) dx 


d 
4 (2) Find | Ξ : ς by two methods: the use of partial fractions, and 
a x 


the substitution x = tanh u [see also no. 1 (277) J. 
(11) Ifa particle falls from rest through a medium whose resistance is 
proportional to the square of the velocity, it can be shown (in the usual 
notation) that 
dv dy 
— = y— = g — kr’. 
dt dx ὃ 

Hence find v in terms of x; v in terms of ἐ; and the limiting value of the 
velocity. 
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9.10 Lengths of curves and areas of surfaces 


We have already used the method of integration to find both plane areas 
bounded by curves and also volumes bounded by surfaces of revolution, 
and we now complete our account by a short investigation into the lengths 
of such curves and the areas of such surfaces. A refined treatment of these 
topics is difficult as it is necessary to seek protection against the curve 
being too crinkly or the surface being too corrugated, so we shall restrict 
ourselves to an introductory look at smooth curves and at the surfaces 
formed by their revolution. 


The length of a curve 


y 


Suppose _tt that a fixed point A is taken on the given curve and that the arc- 
length AP from A toa point P (x, y) of the curve is s. 

Let us further suppose that P’(x + dx, y + dy) is a neighbouring point 
of the curve and that PP’ = ὅς. 


Then (6s)? “ (dx)? + (dy)? 4) 


ds \? dy \* 
ὦ 


So it appears (and, provided the curve is smooth, this can be fully justified) 
that: 


τῶ 


Alternatively, if x and y are known in terms of a parameter f, 


(5) = (#) + (2) 
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ds \? dx \? dy \? 
=> —- = an + ee 
dt di dt 
dx \? dy \? 
= ἘΣ —| dt 
- πἰϑῷ τῶ! 
ἀψλ' dx \? dy \? 
= l -- ΞΞ — —] di 
= Jee) = VG) τῷ 


Hence 


Example 1 


Find an expression in terms of x for the length s of the curve y = ¢ cosh x/c, 
measured from its lowest point. (It can be shown that this curve, known 
as a catenary, is the curve in which a uniform flexible chain hangs when 
suspended between two points. The constant ὁ = T,/w, where 70 is the 
tension at its lowest point and w is its weight per unit length.) 


x 
y = ccosh-— 
᾿ 


τ, 2 sinh - 
x dy 2 

= 1+(—) a 
i+) ἃ 

-| ( 4 sinh? *) dx 
0 C 


\ cosh ie dx 
C 


0 


= [ sinh ‘| 
€ lo 


x 
=> 5 = csinh-- 
C 


y 
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(Hence, further, we see that 


2 2 


x Fou 
γ΄ — s* = ε΄ cosh? — — οὐ sinh* — = οὗ 
ὃ ¢ 


=> y 52. 52 + =) 


Example 2 


A wheel of radius a is rolling in a straight line along a horizontal road. Find 
the total distance travelled by a point on its circumference between succes- 
sive contacts with the road. 


Suppose that the point P was initially in contact with the road at O, but 
that the wheel has since rolled through an angle @. If M is its new point of 
contact, since the wheel has been rolling, 

OM = PM = a0. 
Hence «x = a0 — asin@ 


and y =a—acos8@. 


So the curve traced by P (known as a cycloid) can be expressed parametric- 
ally as 


x = α(θ — sin 8), y = a(l — cos 9), 


and we need to find the length of this curve between the points θ = Ὁ and 
θ = 2π. 


d d 
Now Sa — cos 0), <t — asin 0. 


dé dé 


2% dx 2 dy 2 
So s= = mts 
Oo 5 | (5) + (#4) dé 


2π 
-o| (1 — cos 0)? + sin* 6 dé 
0 
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2π 
a) J (2 — 2 cos 8) dé 
4" J (4 sin? 50) dé 


ἫΝ sin 56 4θ 
0 


᾿ 


2a [—2 cos ἐθ]: ἢ 
= Δ] -- 2.5.1} ae 2 (1) 


=> s= 8a. 


Exercise 9.10a 


1 Sketch the following curves and find their lengths between the given 
points: 
(1) y= χα, from (0, 0) to (4, 8); 


(1) y=Insecx, fromx = Οἴο ἐπ; 
(111) x = Qat?, y = 3at?, fromt = Otot = 1; 
(wv) x =at?, y =2at, fromt = Otot = 1. 


2 Sketch the astroid 
x =acos*t, y=asin't 


and find its total length. 


The area of a surface of revolution 


We begin by investigating the simplest case, of a conical surface being 
formed by the revolution of a straight line. Now if a complete cone of 
base-radius r and slant-length / is cut along a generator, it can be un- 
wrapped to form a circular sector of radius / and arc 277. 


So the angle of the sector is 27r/l, and its area is 
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We now proceed to consider the section of a conical surface whose end- 
faces have radii R, r, and whose slant-length is /: 


l 


_ 
nee - 
~ 
ke | 
Ξ \ 
πο 


— 
— 


If the semi-vertical angle of the cone is a, then the area of its curved surface 


= 1R(R cosec «) — mr(r cosec a) 
= π(κ + r)(R — 7) cosec ἃ 
=7m(R+r)l | 

R+r 


l. 


Ξ- 2π 


Hence Area of curved surface of section 
average circumference Χ slant length 


We can now move to the case of a surface formed by revolving a curve 
about Ox: 


y 


If S denotes the area of surface generated, we see from consideration of a 
small element that 


OS = QIn(y + 5 dy) ds = 2πῃ ds 
ds 


ae ee 
" ας Ἢ 
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Example 3 


Find the area of the cap cut off by the plane x = a when the parabola 
γ΄ = 4ax is rotated about Ox. 


y? = 4ax 
= y= +2/(ax) 
dy a 
Se es ee ἦς 
“ ἃ Ag 
Hence area = | 2% ds 
a d 2 
= | 2πῃ jl + (2) dx 
3 x 


= 4n./a [3 (x + @)?7], 


= 4π. [ἃ Pa a - | ge 


3 
8(2,/2-—1)_, 
ae 


= Required area = 


Example 4 


Investigate the surface area of the portion of a sphere between two parallel 
planes. 
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Suppose that we need to find the area between two circles whose declina- 
tions from Ox are «, f respectively. 


Required area = | 2ny ds 


B 
- | 9na sin 0 x adé@ 


ax 


B 
ona | sin 0 d@ 


2na*[ —cos oy 
= 2πα" (cos ἃ — cos β). 
Now if f is the thickness of this portion, ¢ = a(cos ἃ — cos f) 


= Required area = 2nat, 


and so is dependent only upon the distance between the parallel planes and 
not upon their position. 


We can also consider the circumscribing cylinder and immediately see 
that the corresponding area on the cylinder is also 2zat, so we obtain 
Archimedes’ theorem} that the area of a ‘zone’ on a sphere is equal to the 
corresponding zone on the circumscribing cylinder (or, in other words, 
that if a spherical hard-boiled egg were put through an egg-slicer whose 
wires were all equidistant, each portion would have the same area of 
curved surface!). 


Exercise 9.10b 


1 Find the areas of the surfaces of revolution formed by rotating 
1) thecatenary y = cosh (x/c) about the x-axis, from x = Ὁ tox = ¢; 
ii) the parabola y? = 4ax about the x-axis, from x = Ὁ tox = a; 


(iit) the astroid x =acos*? 0, y = asin” 0 about Ox; 
+ Archimedes of Syracuse (287-212 BC), who requested that this diagram be inscribed on 
the splendid tomb which was erected for him by the Romans. 
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(iv) the cycloid x = α(θ — sin 8), y = α(] — cos 8) about Ox, from 
θ = 0 to 2π. 


9.11 Areas and lengths in polar coordinates 


If we are given the equation of a curve in Cartesian coordinates, we have 
seen how the inclination Ψ of its tangent to Ox, the area A beneath the 
curve and its length s can all be derived from an examination of the dia- 
grams 


: 
oy 
N 
ὃ 
δά ποὺ ὃν: ἴλη ye τς; (55)? & (5x)? + (Sy)? 
x 


d 2 
=> A= | yar any =— =| fi + (St) ae 


Now in polar coordinates, instead of the inclination & which the tangent 
makes with Ox, it is usual to consider its inclination @ to the radius (or, if 
preferred, its angle of trail) : 
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So, just as in Cartesian coordinates, we can find formulae for A, ¢, and s 
by considering the figures formed as P(r, 0) moves along the curve to its 
neighbouring position P’(r + dr, 8 + 68): 


From the second figure, it is clear that 
6A ~ AOPP’ = 5r(r + Or) sin 00 
=> 6A καὶ $r* δθ. 
Furthermore, since PN has been drawn perpendicular to OP’ it follows 
that 
ON =rcos00 xr 
and PN = rsin ὅθ = τ 06. 
So in the elemental AP’PN, 
PN = 7 00, P'N & Or, 4 ΝΡ̓Ῥ x ¢. 
ie 
Or 
and (ds)? x (dr)? + (r 60)’. 


Hence tang & 


Summarising these three, we see that 


6A = Sr? 60; ἴδῃ ᾧ & re ὃς ~ ./(dr)? + (r 60)? 


dé 
A Ιν dé; an @ re 
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Example 1 


Ifr = e°°°'* (where ἃ is a constant), find A, , and s in terms of r (measuring 
A and s from the point where θ = 0). 


θ 
(i) A - | 1,2 4θ 


0 


θ 
ἘΞ | see dg 


ΕΗ 
4 ςοἱ ἃ ὃ 


+ tan α (625 “α΄ — 1) 
= f(r? — 1) tane 
dé 
(11) tang = r— 
dr 
γ 8 cota 


~ dr/d@ cot a 68° 


Hence @, the angle of trail, is constant, so that the curve is known as an 
equiangular spiral: 


θ 2 
iy = (+ ea 
0 
8 
a | J (cot? α .29 “οἷα ue pres) 40 
ὃ 


θ 
= COsec a | 99 “οἱ α 0 


0 


οθ cota θ 
= cosec ἃ = sec a (e°°%* — 1), 


cot ἃ fo 
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Exercise 9.11 


1 Sketch each of the following curves, finding also their total lengths, 
their total areas, and ᾧ in terms οὔθ. 

(2) r= acosQ@; 

(11) r= a(1 + cos@), a cardiord. 


2 Sketch the lemniscate r? = a? sin 20. Find its total area and also @ in 
terms of θ. 


3 Sketch the trefoil r> = a* sin 30 and find the area of each petal. 


4 Sketch the Archimedean spiral r = αθ. Find the area of the sector and the 
length of arc between the points where 0 = 0 and 0 = 1. 


Miscellaneous problems 9 

rt (1) Calculate (1 + 1/n)" 

for n= 1, 2, 3,5, 10, 100. 

Does it appear to tend to a limit as n — 00? 
(1) Ifn = 1/h, show that 


lV In (] h In (1 h) — In 1 
ΟΝ Σὰ ee τὸ πε 
ῃ ἠ h 


Letting ἢ — 00 (and so h > 0), show that 


Νὰ Ιλ" 
In [ + ἢ — 1, and hence that [ + ἢ — 6. 
ῃ n 


(111) Prove similarly that, as n— oo, 


x \" 
(+2) — ¢@. 
n 


2 By the method of two staircases (4.2), show that 


ΤΟ 10° 108-1 1 
— lies between >. — and > —- 
1 x r=2 r=1 7 


Hence show that the sum of the reciprocals of the first million integers 
lies between 13.81 and 14.82. 


4 By considering the graph of 1/x or otherwise, show that 
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] 1/1 Ι 
Inn —In(n—1) >- and Inn —In(n—1)<<=|-+ 
n 2\n π-|} 
where ἢ is an integer greater than |. 
The function f (n) 1s defined by 
15... δ] 


] 
f(n) =1+=+=-4---+-— Inn. 
2 3 n 


Ἂν 
Show that f(n) decreases as n increases and that f(n) — on increases as n 
n 


increases. Deduce that f (n) tends to a finite limit as n tends to infinity. 
(α.5.) 
4 Show that 


n n n+] 
[msde < $inr<| In x dx 


1 r=1 1 
and deduce that the geometric mean of the first n positive integers 15 
approximately n/e. (α.5.) 
5 (1) Investigate the behaviour as x — οὐ of the ratios 
In x In x In x 
x AE ey x 
(11) Show that if¢ > 1, 
] 2 ] 
t/t 
and hence that 


Inx « 2(./x = 1) 


In x ( 7 
an: <= De | 
x Ν᾿ x 


Deduce that as x > οὐ, 


age 0. 
x 


(111) Show that 


and use (11) to deduce its limit as x — οὐ. 


(10) More generally, show that as x --Κ οὐ, Inx— o more slowly than 
any positive power (however small) of x. 
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6 (1) Investigate the behaviour as x — οὐ of 


e~ οἷ ΓᾺ 


=) =e 


x xX Xx 


(1) Show that 


if “(1 - =) 
x x 


and use the result of no. 5 (11) to show that as x > οὐ. 


x 
Ιῃ --- - οὐ 
x 


x 


e 
and hence that —— οὐ. 
x 


(111) Deduce that as x > οὐ, 


οἷ 


------ -- 
1000 
x 


OO. 


(10) More generally, show that as x > +00, e*~— +00 faster than any 
positive power of x; and that e * — 0 faster than any negative power of x. 


7 1 E(x) = | te 'dt 
0 
show that J,(x) = —x"e * + nl,_,(x). 
If 1,(x) > I, asx --- +00, use the result of no. 6 to show that 
i = nl, 1 
and hence that, ifne Z*, 


I, =n} 


8 (i) Sketch the graph of the function ¢~** and illustrate the integral 


i= | e* dx. 


Can you calculate this integral? 
(1) Illustrate by a surface the function of two variables f (x, y), where 


f(x,y) = eo” 


(111) Show that the volume under this surface can be represented in Car- 
tesian coordinates as 


lim © δ᾽ 6 **~¥* 6x dy = cone dx dy 
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and in polar coordinates as 


ye Νὰ dr) (r 08) = re" drdé; 


and hence that 


00 00 00 2% 
e-* dx 9.5 dy) = re” dr dé. 


— oO — οὐ r=0 θ-Ξ0 


wv) Finally show that Γ re” dr = ἱ ἀπά hence calculate 1. 


g Show that, if 
N 


x)= δ᾽ dy sin nx, ] 
1 
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2 π 
then a, ΞΞ -- [ f(x) sin nx dx. (2) 
R Jo 
Assuming that the function 
f(x) =x(n— 3), (0 « α <7) 


can be expressed as an infinite seriest 


oO 


> a, SIN nx, 
1 


and that the coefficients are still given by the formula (2), show that in 
this case 


8 
Qom = O, Qom+1 = Ome Be 
and hence sum the series 
Ι Ι 
] τ τς (α.5.) 


10 Ina population of N rats, πὶ initially have a disease, the rest have not 
yet caught it. If a rat recovers from the disease it becomes immune. At a 
later time, ¢, the population is therefore composed of /(¢) rats ill with the 
disease, D (ἢ) dead rats, R(t) rats who have already recovered and S(t) who 
are still susceptible. 

It is observed that at any stage 
dR dD ds 
aa al, a bl, a cS 
where a, 6, and ¢ are constants, and N — n > (a + b)/c. 

Prove that J increases until S falls to (a + 6)/c, and then decreases. Prove 
also that the maximum value of / 1s 


a+b - ec(N — n) : (ο.5.) 


Ν -- 
ὃ α- ὁ 


11 ‘Two tribes live on ἃ desert island. Each tribe breeds only among itself 
and, in isolation, the population of each would increase at the same constant 
rate. But each tribe is eaten (by the other) at a rate equal to a constant, ὦ, 
times the population of the other tribe. At a certain time the total popula- 
tion of both tribes together is Ny. Investigate how the total population 
varies with time subsequently. (ο.5.) 


12 JAMEs: ‘7 is the most important constant in mathematics.’ 
Joun: ‘No, ¢ is.’ 
Continue the discussion. (α.5.) 


t A typical example of a Fourier series, so called after their discoverer, J. B. J. Fourier (1768— 
1830). . 


IO 


Complex numbers 


‘I met a man recently who told me that, so far from believing in the 
square root of minus one, he did not even believe in minus one. This is at 
any rate a consistent attitude. There are certainly many people who 
regard ,/2 as something perfectly obvious, but jib at τ, — 1. This is because 
they think that they can visualise the former as something in physical 
space, but not the latter. Actually, ,/—1 is a much simpler concept.’ 

E. C. Titchmarsh, Mathematics for the General Reader. 


10.1 Introduction 


The problem of complex numbers arises immediately in attempts to solve 
such quadratic equations as x* + 6x + 13 =0 - x= -—3+ J(9 — 13), 
which is seemingly impossible since there does not apparently exist a 
number whose square is —4. Understandably, many people give up at this 
point and say in desperation that this quadratic has no solution, meaning 
‘has no solution among the reals’. But a pupil who had learned to solve 
quadratic equations by factorisation might equally well say that the 
equation x* — 2x — 4 =0 cannot be solved, on the grounds that he 
cannot factorise the left-hand side. In fact, he would find, a little later in his 
mathematical education, that x = 1 + a 5 and 1 — J 5. are solutions of 
the latter equation, and probably feel quite happy about the existence of 
the numbers 1 + a 5 and 1 — ,/5. When at the earlier stage he had given 
up in his attempt to solve x7 — 2x — 4 = 0 because it would not factorise, 
what he really meant was that it had no solutions among the rationals, because 
he was attempting to find rational factors of x* — 2x — 4. In fact, of course, 
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it does ‘factorise’, as (x — 1 + ,/5)(x — 1 — ./5), but only with the use 
of irrationals. 


Now it 15 true, as Professor Titchmarsh implied in the quotation above, 
that ,/5 is a far more complicated concept than “ — 1, and we have already 
shown, in 1.5, that such numbers as Ἢ 5 are irrational in the sense that they 
cannot be expressed as the ratio of integers. Yet most people are happier 
with (5 than with ./—1. Why? Perhaps simply because τ 5. can be seen, 
with a place in the real number line? By contrast, ,/ — 1 and the roots 
of x? + 6x + 13 = 0 do not appear on the number line, and so it is 
tempting to suppose they do not exist. 


Now the mathematician’s attitude at this stage is to say ‘It would be a 
great pity if we had for ever to suffer the irregularity of having some 
quadratic equations with two roots and others with none. How much better 
if all quadratics were to have two roots! We shall not let an equation like 
x? + 6x + 13 = 0 beat us! Let us invent a “number” J- 1, which we 
shall call 2, and suppose that it obeys all the usual laws of algebra (which 
will be stated in full in 14.1)’. Upon this bold hypothesis we should then 
say that the roots ofx? + 6x + 13 = Owere —3 + J -4 = —-3+ 2/—1 
= —3 + 21. Notice that the word ‘number’ applied to «2 —1 is in inverted 
commas, because we must not think of it in the same terms as we have 
thought of numbers hitherto, as a length or a measure. Rather is it a number 
in the sense that it behaves like ordinary numbers, and for some time we 
may use 7 without being too scrupulous about its meaning. This is what 
happened historically: for centuries mathematicians ‘played with 2’, 
treating it as an ‘ordinary’ number and using it to obtain extremely 
valuable results. For example, 


(a” + b?)(c? + d?) = (α" — i7b?)(c* — i7d?) 

(a + 2b)(a — 1b) (¢ + id)(c — id) 
(a + 1b)(c + id) X (a — 1b) (c — id) 
= (ac + 1*bd + ibe + iad) (ac + i*bd — ibe — iad) 
= [ac — bd + 1(bc + ad) ]\[ac — bd — 1(bc + ad)] 
= (ac — bd)? + (be + ad)*. 


The final result is quite valid, even though the working involved the use 
of the ‘invented’ number 7 (the word ‘imaginary’ is a somewhat unfor- 
tunate choice). And in many other situations, mathematicians have found 
that perfectly respectable results can be derived — often more shortly — 
by the use of 7, and this provides strong encouragement for belzef in this 
extension of the number system resulting from such a bold hypothesis. 
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A word of warning is necessary about bold hypotheses. Sometimes 
mathematicians have to accept defeat. For example, the equation ax = b 
always has a solution, provided a # Ὁ. What a pity, exclaims the true 
mathematician, that this exception should have to be tolerated; that there 
should be just one number 0 which does not possess a reciprocal. Let us 
therefore invent a new number, οὐ, such that co x 0 = 1 = 0 x o. 

This time, unfortunately, the bold hypothesis must be rejected. For, 
starting with 0 x 1 =O x 2 and multiplying both sides on the left 
by 00, we have οὐ X (0 x 1) = @ x (0 x 2) 


=> (co Χ 0) xX 1 = (@ x Ο) Χ 2 (associativity of 
multiplication) 
=> 1x1l=1x2 (since o x 0 = 1, by 
definition) 
=> 1 = 2, a contradiction. 


So we have run into trouble, and the assumption, that a reciprocal for 0 
can be dreamed up, is untenable. 

However, it can be shown that no such inconsistency arises in the case 
of the invention of 7, and that everything proceeds happily. Nevertheless, a 
rigorous justification of the invention must be provided at some stage. 
Historically, this came early in the nineteenth century after z had already 
had a long and useful innings. For as early as the sixteenth century mathe- 
maticians were experimenting with the square root of minus one, and had 
observed that consistent and acceptable results could be obtained, though 
they were unable to justify these procedures. Later mathematicians such 
as Euler (1707-83), continuing the work of John Bernoulli (1667-1748) 
and de Moivre (1667-1754) developed complex numbers as powerful 
mathematical tools; whilst, at the end of the eighteenth century, Wessel 
and Argand gave their celebrated geometrical interpretation. But it was 
not till the beginning of the nineteenth century that Gauss (1777-1855) 
laid a sound logical foundation for complex numbers, and then proceeded 
to develop the theory of functions of a complex variable. 

Now the invention of z to provide solutions for all quadratic equations 
may possibly give the impression that further inventions may be necessary 
to solve cubic equations, and possibly that even more weird and wonderful 
numbers might have to be dreamed up to deal with equations of higher 
degree; and what further miracles of man’s genius would be needed to 
assign meanings to such expressions as (2 — 7)‘ and ./[sin (5 — 27)?/9]? 
Happily, such further inventions are not necessary; the extension of the 
number system by the single step of adjoining the square root of —1 
enables all these situations to be covered! The roots of equations of any 
degree of complication, and numbers of any degree of ‘complexity’, can 
be expressed in the same simple form, x + zy, where x and y are real, just 
like the roots of the quadratic equation x? + 6x + 13 = 0 that we 
examined at the outset. It is this remarkable property of playing a unifying 
role in mathematics that endows 7 with such interest and importance. 

Our work in the present chapter will therefore rest on the assumption 
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of a number ἱ satisfying 7* = —1, and we define a ‘complex number’ to 
be the sum of two parts: 


Z=x + yy, 


where x and y are real. x is called the ‘real part’ of the complex number, 
and the second term, zy, is the so-called ‘imaginary part’. We also write 


x = R(z), wy = Iz). 


In the manipulation of complex numbers, we simply assume that the 
symbol 7 will behave exactly as an ordinary number, and that in any 
manipulations 7* can be replaced by —1. 

We observe immediately that the sum and difference of two complex 
numbers are also complex numbers; or, as we shall say in 14.2, the set of 
complex numbers C = {x + iy: x, ye R} is closed under addition and sub- 
traction. For ifz,; = x, + ty, and Zo = X%) + to, 


then 2, + Z. = x, + yy + Xo + ty 
= (x, + XQ) + (yy + 92), 


which is also a complex number; and similarly for subtraction. It is less 
obvious that the set is closed under multiplication and division. 


However 2,2) = (x1 + 141) (Xo + Wo) = (*1%_ — Y1Y2) + A(%1Y2 + X2y1), 


while 2 eat ys _ (er + 1) (Xe -- Yo) 


Z9 Xo + Yo (Xo + 12) (X2 — Ye) 
= Χ Χο + Ψ 1}. XY, — τα). 
x5 + 95 x3 + 5 


both of which are in the form x + zy. Thus the repeated processes of 
addition, subtraction, multiplication and division by numbers other than 
zero will lead every time to another complex number, and the system is 
self-contained. 

Moreover, we note that two complex numbers are equal if and only if 
their real and imaginary parts are separately equal: 


For Ζ, ΞΞ Ζ => αι — χω Ξ ἴ(. -- 91) 
=> 
=> (xy — 2)? + (ψι — Yo)? = 0 
> x; =x, and ¥; = ¥2 


(since each of the perfect squares of real numbers must be positive or zero). 
The converse is trivial; and we have also the corollary that 


z=0O0 => x«=0 and y=0. 
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Example 
In each of the following cases express the given number in the form x + yy: 
7-30 7 — 31 . t= s ΤῸ 3 
( -- 4)2 4-164 1612 -—16i—12 -- 4(8 + 41) 
2 Se τ CS soa) 
τ —4(3 + 41)(3 — 41) 
(multiplying top and bottom by 3 — 47 in order to obtain 
a real denominator) 


_ 21-371 +127? 9 — 37 


(2) 


ah NO YET δον 
—4(9 — 161 —100 en 
] ] 
7 Ι πε es EN ριξε: :- - - σὐδς oh τοὐττοὸῷ ὁ. 0 
ca) cre) Cee) 14343240 1-3+32—2 


Lli-it-—1—12 —21 


EF a geen ne =o ee eee oe ΞΟ 
Ce πε ee | ee 
(it) Z xt y (x + yy)(x-1—- 2) 
z-l x+y-1 (*-—1l+zuy)(x -- Ε 1 -- 1) 
a+ ym xm ty ty x —y 


στη δεν @ eR GD ty 
or we might proceed: 
Z | x—-l-y 


2--} Ζ-- 1 (χ -- ἰ 1 τ ᾿᾽)7΄ει --ὀ ἰ -- 22) crs 


Exercise 10.1 

t Express the following in the form x + zy: 
ἢ) (—l—12) + (-2 — δὴ); 

a) (4 — 31) — (6 + 51); 

11) St(1 ~ 21): 


τσ τς 
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(xz) (x — 1)" (xe R); 
(xii) (x + ty)° (x, ye R); 


τῶ: Ι 
(χ111) Use the result of (x) to simplify -and * 
a) 7 +1 
2 Express in the form x + 1y: 
| Ι ΒΕ. ες | 
Fay Ngee τα ay 
segs? ah RA 1.3... 
. : l =—3 
(av) 3 ο: (v) 7 (vz) (1 + 21) 
(2—7)*? (147) 1 


ὧν cos @ -- ἰ5ῖπθ. 
3 Find integers ῥ, Φ such that (3 + 71) (9 + 14) is purely imaginary. 


4 Ifz=x+y (x, y € R), express in the form X + 7Y: 
he Ae Sse Ι 
(1) Σ΄: Ὁ: (11) Ζ - -- 
Zz 


5 Show that (4 — 51)(2 + 31) = 23 + 21, and deduce factors of 23 — 21. 
Hence show that (47 + 57)(2* + 37) = 23% + 25. 
Use a similar method to express (47 + 77) (97 + 57) as the sum of two 
squares. 

Ζ Σὰ ©) 
344° δ᾽ 8 .- 4 


6 Solve the equation for z: 


7 Find real values of x and y such that (1 + 32)x + (2 — 52)y + 22 = 0. 
8 Evaluate ./(1 + 7) and PY (5 — 121) by letting the result be x + zy in 


each case. 
9 Verify that x = 2,y = —1 satisfy the simultaneous equations 
= Gay -- 2, ὥχήῳ —y? = --͵ΠΙ. 
Hence find the cube root of 2 — 1112. 
10 Find real values of χ and y to satisfy 

x y —5 + 6 

ae =p oe 

l-2 3-21 Ι - 8) 


Ir Solve the equations: 

(7) 27° +4z+29=0; (11) 427 — 122 + 25 = 0; 

(wi) 227 + 2iz+1=0; (0) 2 z= 2; 

(v) χ᾽ +1027 + 372 +42=0; (wi) (2 Ὁ ἐ)χῇ —-z+ ὦ -- ἢ Ξ 0. 


12 Find the quadratic equations whose roots are: 
(ἡ) 3+. /5,3 -- - 5; (1) —3 +i, --3 -- 1; 
(iii) 2+ /3i,2 — 31; (iv) —i,1 + 21, 
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12 Find the sum and the product of the roots z, , Z. of the equations 
(i) z27#—-5z+7=0; it) 2 ee eT 10. 
Obtain in each case the value of z¢ + z5. What do you notice, and how 
do you account for it? 
14 Sum the series: 
(1) Llti+?%4P4---4%; 
(1) 1 — 2] + 4132 — 81"  ---- (—27)""'; 
(111) 1 + 21 + 317 + 41) +--+ 4+ ἡ]. 

B 


1 χ-- 


15 Express in partial fractions in the form 


x? + ] 


R t for th . 
epca or e expression ee ΜῈ ἢ 


10.2 The Argand diagram 


We have seen that any complex number consists essentially of a pair of real 
numbers: z = x + ty, where x and y are real. In view of this aspect of a 
complex number as an ordered pair of real numbers, it is natural that we 
should choose to represent them by the points in a plane using a pair of 
coordinate axes Ox, Oy, and by assigning the complex number x + iy to 
the point whose coordinates are (x, y). It is clear that this sets up a 1, 1 
correspondence between the complex numbers and the points of the 
Cartesian plane. For example, 


1 is represented by the point (0, 1), 
πα 1120} Ὁ} 
--ϑὶ (0, --5), 
—4 ( aus 0) ) 


and conversely, the point (1, —3) represents the complex number | — $i, 


and so on. This method of representing the complex numbers is referred 
to as the Argand diagram after the French mathematician J. R. Argand 
(1768-1822). Ox and Oy are called the ‘real’ and ‘imaginary’ axes, for 
those complex numbers of the form x + 02, i.e., the real numbers, are 
represented by points of the x-axis, whose réle is thus seen to be simply 
that of the real number line; while those complex numbers whose ‘real 
parts’ are zero appear in the Argand diagram on the y-axis, and are some- 
times loosely referred to as ‘purely imaginary’, among them being the 
number 7 itself. 

We have already emphasised that it is a mistake to try and think of 7 as 
a number in the sense of a ‘magnitude’, and it is only after the Argand 
diagram has been established that it becomes possible to acquire any sort of 
visual insight into the nature of 7. 
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If 2 is regarded as a geometric operator, or an instruction to perform a 
certain geometrical transformation, then the equation 1“ = —1 may be 
translated to mean that the operation 2, when performed twice in succes- 
sion, is equivalent to the operation represented by multiplication by the 
number —1, which causes a reversal of direction. For if P(x, y) represents 
x + iy, then the effect of multiplication by —1 is to replace x + zy by 
—x — ty (see p. 106 ‘negative’), and this reverses the vector OP, rotating 
it through 180°. Since, therefore, i? causes a 180° rotation in the plane, it 
is a short step to interpret 7 as a 90° rotation, for then the equation i* = —1 
states that two successive quarter-turns are equivalent to a half-turn. 
Whether the quarter-turn is clockwise or anti-clockwise is arbitrary; with 
the usual choice of Cartesian axes, it will be observed that, if U represents 
the ‘unit’ point, 1 + Οἱ (or the real number 1), then OU must be given 
an anti-clockwise quarter-turn to take up the position OI, where I repre- 
sents the number 7, so that with the usual system of axes the sense of the 
90° rotation representing 1 is anti-clockwise. 


y 


More generally, if z 
then IZ = 


so again we see that multiplication of any complex number by 1 leads to a 
rotation of the vector through 90°. What is the effect of multiplication 
by —72? 


Addition and subtraction in the Argand diagram 
If Ζ, =x, ty, and 2, = x2 + 17,2. 
Then 2, + 2. = (4; + %2) + {{}, + Y2). 
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Ζι + 22 


22 


So complex numbers are added like displacements, or like position vectors, 
according to a parallelogram law. Indeed, the above equation might have 
been written as 


ia ee 
γ᾽ Yo Y¥1 + Ye 


this means we can regard complex numbers in the Argand diagram to be 
represented as well by position vectors as by points. But it must be empha- 
sised that a complex number z corresponds not only to OZ but to the whole 
class of equivalent vectors which are equal and parallel to OZ (or, as we 
shall say in Chapter 14, the additive group of complex numbers is 150- 
morphic with that of the two-dimensional vectors). 


Continued addition 


Ζ, τ Zo + 2: Ζᾳ 


24 


When several complex numbers are being added, there is no need for 
them to be combined laboriously in pairs using the parallelogram law 
repeatedly. Any number of vectors may be added by ‘nose-to-tail’ addition, 
the order in which they are added being immaterial in view of the associa- 
tivity and commutativity of vector addition. Such an addition of complex 
numbers Z,, Zy, Z3, Z4 is illustrated by the vector polygons overleaf. 
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Ζ-: <4 
(1) Ζ, + 2; + Z3 + Z (11) Ζι + Ζ: + Zz + Z%& (42) Z4 + Z3 + Ζι + 22 
The negative of a complex number 
Ifz=x + yw, then —z = —x — ty, so that —z 1s obtained geometrically 


from z by means of a half-turn about the origin. 


y 
x 
-Ζ 


The subtraction of complex numbers 


When we say, in the field of real numbers, that 7 — 4 = 3, this is open to 


two interpretations : 
(1) 3.15 the number which, when added to 4, makes 7: 3 + 4 = 7. 


More generally, a-—-b=c @ b6+¢=4; 
(11) 3isthe sum of 7 and —4: 7 + (—4) = 3. 

More generally, a — ὦ = a + (—)), or subtraction is equivalent to 
the addition of the negative, or additive inverse (see 14.1). 

Similarly in the field of complex numbers, Ζ; — Z, has two interpreta- 
tions, each of which can be seen in the Argand diagram: 


(1) z, — Z is represented by that vector which, when added to 22, gives 
z,, and (11) z, — Z, is the sum of z, and (—z,), and when z, and ( -- Ζ9) 
are added by the parallelogram law to give z,; — Z, it is immediately clear 
that the vector joining the origin to this point is equivalent to that joining 
Z5 to Z,. 
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oe 


The first of the two geometrical interpretations is more useful in practice, 
and we may say generally that, if complex numbers a, ὦ, ¢,... are repre- 
sented by points A, B, C,..., then 


a—b isrepresented by BA 


b—a AB (note the ‘reversal’ ) 
q—p PQ 
and so on. 


Multiplication by a real number 


It is evident in general that if two complex numbers z,, Z, are so situated 
that the vectors OZ, and OZ, have the same direction, then one is a real 
multiple of the other. For example, if z, = 3 + 52, and z. = 9 + 15), 
then OZ, = 3OZ,, these vectors having the same direction. Similarly, if 
zz = —14 — 247 then we have OZ, = — OZ, : these two vectors have 
the same direction, but the minus sign informs us of their opposite senses. 
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Exercise 10.2a 

1 Plot the following points on the Argand diagram: 
(1)1 --; (1) —2 + 32; (a2) —2 — 32; (w ᾿ oer 


(v) ἢ Ὁ 5: 
(vi) οο5 ἐπ + ἐϑβϑίη ἐπ; (viz) οο5 π- isin π; (viii) δεῖ aia 


2 If z= 1 +i, simplify z*, z*,z*,..., and mark these points on the 
Argand diagram. 


3 Ifz, = 2 —1,z, = 1 + 91, mark on the Argand diagram the points 


z 
215 295 Ζ᾽ + Zo; 24 ae 295 Zo =o Ζ1. 2129, tk: aia | 215 etc. 


] 
z, wz, z—1, 3z, —3z, — 2”. 
Ζ 
Mark also the points ζ΄, —z’ and zz’ where z’ = —5 -- 121. 


5 Show that the points a, ὁ. c, d (€ C) form a parallelogram if and only 
if the sum of two of them is equal to the sum of the other two. 


6 Give a geometrical interpretation of a — ὦ = +1(¢ — d) where 
a,b,c,d€C. 


7 If A, B represent complex numbers a, 5, obtain the points which 
represent: 


(i) Ha+b); () $a +4; (ii) a+ 4(b— a); (iv) b+ Ha -- ὃ). 
8 ABCD 15 a rhombus with AC = 2BD. If b = 3 +1,d = 1 — 91, find 


aand ¢. 


9 The centre of a square is the point —2 + 2, and one vertex is 1 + 91. 
Find the others. 


10 ABCD is a square, with a = 3 + 1, ὁ = 4 — 22. Find possible posi- 
tions of C and D. 


1x ABCD isa square with a = —2 + 51,¢ = 4 — 81. Find ὁ and d. 


2 Ifa=2—i1,b = 4 — 31, find two values of ¢ so that triangle ABC 
is equilateral. 


Division of a line in a given ratio 


If P is the mid-point of AB, it is also the mid-point of the other diagonal of 
the parallelogram formed by OA and OB as adjacent sides, so that 
p= Ha + δ). 
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More generally, if P divides AB in the ratio β:α, then since AP and PB 
have the same direction, and their lengths are in the ratio β:α, it follows that 


aAP = BPB 
so that a(p — a) = B(b — p). 


_ aa + Bb 


Hence (a + B)p = aa + Bd ἘΝ 


Note that (a) when a = B = 1, we get p = ξ(α 6) as the mid-point 
of AB; (δ) when either ἃ or β is negative, the point P will lie outside the 
line segment AB, in the extension of either AB or BA; (c) we must exclude 
the case a = —f; (d) we choose to name the two parts of the ratio so that 
B and ἃ are reckoned from A and B respectively. The reason for this is 
explained elsewhere (see 11.1). Briefly, P is the centre of mass of ἃ at A 
and β at B respectively. 


Centroid of a triangle 


If G is the centroid of triangle ABC whose medians are AP, BQ, CR, we 
have p = 5(b + c). Now G divides each median in the ratio 1:2 so that 
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PG:GA=1:2 = g=4pt+4a=2x (6 ὦ -- ἰα τὸ ξ(α -- ὁ - ἡ. 
Equating real and imaginary parts would give us the coordinates of G as 
[a(x + x. + ¥3), 3(y1 + Yo + ¥3)], 

where A, B, and C are (x,, ¥,), (Χο, Yo), and (%3, ¥3). 


Exercise 10.2b 


1 Find the centroid of the triangle ABC where a = --21, ὃ = 4+ 1, 
c= —10 — 3. 

2 ABCD is any quadrilateral. Prove by using complex numbers that the 
lines joining the mid-points of AB and CD, of AC and BD, and of AD and 


BC all bisect each other. 


3 ABCD isa parallelogram. P is the mid-point of BC, BD and AP inter- 
sect at F. Prove by using complex numbers that F trisects both BD and AP. 


4 Given any three complex numbers a, ὦ, c, show that there exist real 
numbers a, f, y such that ax  ὁβ + cy = 0. Show further that A, B, 
and Care collinear when a + B + y = 0. 


5 The triangle ABC is equilateral. Show that 
c=4(a+ δ) +4/3(a — δ) 


and express ¢ also in terms of ὦ where ὦ = cos 2π + isin $7. Find also 
the complex number representing the centre of the equilateral triangle and 
show that 

a? + ὑ + ¢* — be — ca — ab = 0 

is a necessary and sufficient condition for the triangle ABC to be equi- 
lateral (which may be used to prove ‘ Napoleon’s theorem’, see 10.2). 

6 Given points A, B, C representing complex numbers a, ὃ, ὁ, show how 
to construct the positions of: 

a+t b, 3a + 20, b — 2a, a— C, atb4+e, 

atb-—e, a+ 2b — 3e. 
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Polar form, modulus, and argument 


We have already seen that the addition of complex numbers has an 
immediate interpretation on the Argand diagram in terms of the addition 
of vectors. When it comes to multiplication, if z = z,Z., 


then 2 = ZZ) = (x, + 14,) (Xo + 1.) 
= (%1%_ — Yio) + Ux Yo + X21), so that 
τ Δ ἘΞ Χ Χο τΞ = 
Y = X1Yo + X04 
These formulae for the Cartesian coordinates of the point representing the 
product do not help us to locate it readily on the Argand diagram, and in 


order to provide a simple interpretation it is necessary to use polar co- 
ordinates. 


¥ 


We have, for all non-zero values of z(= x + zy), 
x=rcos@, y=rsiné 


where r (= the distance OZ) is called the modulus of z, written iz 


and @(= £ XOZ) is called the argument (or amplitude in some older 
books), written arg z. 


Hence z = x + ty may now be expressed as z = r (cos 0 + 1 sin 0), which 
we shall usually abbreviate to z = r cis 0, 


where r= |z| = J(x? + y?), and θ = argz = τϑη 
x 


(It must be emphasised that, although when working with polar coordinates 
it is possible for 7 to be negative, here r is aaa A positive. ) 
So, for example, if z = 2 — 81, 


then [2] = (22 + (—5)2) = /29 
arg z (in fourth quadrant) = tan~' (—5/2) = 291° 48’ 


and 2 — δὲ = ,/29 (cos 291° 48’ + isin 291° 48’) 
= «(29 cis 291° 48’. 
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y 


2-5 


But there is clearly an infinity of possible values of arg z, such as 651° 48’, 
1 011° 48’, —68° 12’, —428° 12’, etc. (all of which are equivalent modulo 
360°); so of these we usually select the principal value 0, such that 
-π-θ < 7, the principal value of the argument in the example above 
being —68° 12’. 

Similarly, with the complex number — 2), 


|—2i| = 2, and arg (--21) = --π. 
So —2: = 2(cos (-- ἐπ) + isin (—352)) 


= 2 cis (-- :π). 


Summarising, 


z=x«x+ iy =r (cos θ + isin 8) 
= rcis 0 
Rz =x =rcos0 r = |z| = J/(x? + y*) (modulus) 
Iz = ty = irsin 0 = argz = tan‘ y/x (argument) 


(and for its principal value, --π < θ < πὶ 


Example 1 


Express in polar form: 


: sme, rep oF .. (ἰ — 2)7(3i + 1 
(i) ~/3+%; (ἀ) 2 -- 31; (it) ee (iv) goers. 
In all these examples, and in the following exercises, it is a great help to 
plot the complex numbers on the Argand diagram, as this enables the 


modulus and argument readily to be ‘read off’. 
(ἡ —./3 +i = 2(—$,/3 + {) = 2(cos 150° + isin 150°); 


(ii) [2 — 3i| = /(2? + 37) = 18, 
arg(2 — 37) = tan ' (-- 3) (4th quadrant) = —56° 19’. 
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So 2 -- 31 = ./13 cis (—56° 19’) 


443 (44 3ὴ (-.-5 -- 121) [6 - 63 


(iii) 


12 —5 (—-5+12:)(-5— 127) 6169 
65 (16 63 
= ——(— — —7i]} (for 16* + 632 = 65?) 
169\65 65 
ee 
= —— ΟἹ 
igo 


where a = tan * (—#3) in the fourth quadrant. 


ΝΞ Bi +1) πα (1+ 34) _ (15 + 54) (~3 ~ 8) 
ι eS ~ i ey ee τὰν ee 


_ —40 — 30% 


ΤῊ = —4 — 31 = 5 cis 216° 52’. 


Example 2 


If z = cis 0, express in polar form: 


(1) 1 — Ζ; (22 ) τες oe? (122 ) eee 

(1) 1 —cos@ — isin @ = 2sin* $60 — 21 sin $6 cos 10 
= 2sin 50(sin $0 — 1 cos 30) 
= 2 sin 50 cis(—4n + 46); 


220 2 cis O 
1—z* 1 — (cos @ + isin 6)? 
= 2 cis 0 
1 — cos? 9 — 2isin @ cos 0 + sin? 0 
= 2 cis 0 
~ 2sin O(sin -- i cos 8) 


(11) 


_ (0050 + isin 6) (sin θ + i cos θ) 
sin O(sin 6 — i cos 6) (sin @ + i cos 0) 
i(cos* θ + sin? @) 


sin 0(sin* 8 + cos” 0) ats 
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Gay 2+) cis 8 + 1 cos@ + isin@ + 1 

111 a a  -- ὐὐοὖὦοΕρΡΟςς -- 
z—] cis? — | cos 9 + isin 8 — 1 


2 cos* 50 + 21 sin $6 cos $0 
—2sin* 50 + 21 sin $0 cos 50 
__ cos Z0(cos 30 + isin 36) 

—sin 50(sin $0 — i cos $0) 
—cot $0(cos 59 + isin $0) (sin 30 + 7 cos $0) 


sin* 10 + cos* ὦ 


| 


—i cot 36. 


Note that the two questions above may be done more easily by using the 
device: 


cis 0 + 1 = cis $0(cis 10 + cis(—50)) = cis 30 x 2 cos 10 
= 2 cos 46 cis 50 

cis 6 — 1 = cis $0(cis $0 — cis( —50)) = cis 10 x 22 sin 10 
= 21 sin 40 cis 10 = 2 sin 36 cis(50 + $7). 


We shall return to these again on pp. 122, 137. 


Inequalities 


Since z, + Z, is represented by a side of the triangle, it is immediately 
clear that 


Iz, a Z| < iz,| 1 IZ9| 


(and the equality would only occur if the triangle collapsed into a straight 
line). 
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Z, + Ζ5 


More generally, we see that 


lz, + Ζ; Hes + 2, < |z,| + [zo] +---4+ iz 


while the two further results 
IZ: — Zo] 2 |z,] -- [χη] and |z, — 29] < |z,| + Iz, 


also have simple geometrical counterparts. 


Exercise 10.2c 
1 Express the following complex numbers in the polar form 
r(cos 8 + isin 8), 


giving the principal value of the argument, and illustrating on the Argand 
diagram: 


@) --ι; (@) -- ἐ; Gu) 1+ «γ81; (ὦ) --ἰ; () -- 12 — 51; 


(vw) —3 21; (vz) sina + 2cosa; (viz) sina -- icosa. 
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2 Express in the form x + iy: 
: . 1 οὖ ; aw π ᾽ ‘ 
(2) 4 cis τ᾽ (11) Scism; (11) 10 cis +3 ) (wv) 8 cis 165°; 


71 
(v) 2 cis 800°; (uw ) 6 cis -τ 


3 Ifa=2+121, b= 14 32, mark in the Argand diagram the points 
representing : 


; (ὦ) α; (ἡ) δ; (wi) a? + δ᾽; 


Ὁ | & 


(i) αὐ; (ii) ; ες (ἰὴ) 
and find the modulus and argument in each case. 
4 Ifa =cis$nmand ὁ = 2 cis(—4n), evaluate: 

(i) a2,a3,...; (ii) 62, B8,...3 (iit) αὐ, 


Give the principal value of the arguments and mark each point in the 
Argand diagram. 


5 Simplify cis 0 x cis ᾧ. Use the result to write down 

ae : bs, es 

(1) cis = x cis ἐς; (22) (cis 809). 

6 3 

6 Solve the equations, giving the roots in polar form: 
(() z2—-z+1=0; (i) 2+z+1=0; 
(11) 927 122 +5=0; (w) χῇ --Ζ-ἘἸ --τῤἨ Ξ (. 
7 Find quadratic equations whose roots are: 

' γῶν «τὰς ee π 
(1) cis 120°, cis 240°; (72) cis τ’ ols τττ; 
(111) cosa + ising, οοβα -- 1510 ἃ. 


8 By using the identity (α2 + 6?) (εἶ 47) = (ac — bd)? + (be + ad)’, 
prove that |z,| |z.| = [2.292]. 

9 Using only the definition |z|/* = x? + y*, prove that Ζ,  Ζοὶ < 
lza| + [zo]. 

10 If |z,| = |z,|, prove that (z, + Ζο)λίζ, — Ζ9) is purely imaginary. 
Interpret geometrically. 

τι Prove that |z, + z2|* + lz, — 2)" = 2\z,|? + 2\z,|?, and give a geo- 
metrical interpretation of this result. 

12 Show that the equation |z| = 1 represents a circle of unit radius and 
centre O. Describe what the following equations represent: 

Cy SaaS; (1) |z + 1| = |z — 27; 
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(ia) |z + 31 « 2; (iv) [1 <4 and |z—- 25 1; 
(v) [2 — 3{ + [z + 3] = 10. 


12 Illustrate the regions: 

(¢) jz ΞΕ > Iz +d; (ii) |z}) > 3 > |z — 3}. 

14 Given that |z — 2 — 2| < 5, show geometrically that 
8 < |z — 14 — 6i| < 18. 


15 Prove that 5 


Multiplication of complex numbers 


We admitted earlier that the rule for multiplication, when expressed in 
Cartesian form x + zy, is cumbersome and does not lend itself to simple 
geometrical interpretation. When the polar form is used, however, a much 
simpler result emerges: 


For cis 6, x cis 0, 
= (cos 6, + isin 8,)(cos 8, + isin 8.) 
(cos 8, cos 8, — sin θ᾽ sin 8.) + i(sin 8, cos 8, + cos θ᾽ sin 6.) 
= cos (0, + 6,) + isin (0, + 0,) 
cis (0, + 6,). 


Ι 


So cis 8, cis 8, = cis(8, + 84) 


Hence if z, =7,cis6, and z, = 1, cis 4, 

it follows that 

2:2, = 7, cis θ᾽, X ro cis 0, = 1,7, cis(O, + 9). 

So, in multiplying complex numbers, we simply multiply their moduli 


and add their arguments: 


225 | =z Iz.| and arg(z,z.) = arg z, + arg Z, 


Evidently the rule may be extended for the product of more than two 
complex numbers: 


Z,ZoZ3 = 7, cISO, X ry cis 84 X 713 cis B83, = 1,7. cis(O, + 05) Χ rg cis O, 


T\Torg cis(O, + 8, + 85); 


and more generally, 2,Z)...2, = 7475. «7 οἰ5(θ, + 0, +---+ θ,]. 


So zeae: 2) = Iz, \z2| |z5| . -|Zal 


and arg(Z,Z9Z3...Z,) = arg z, + arg z, + arg z, +---+ arg Z. 
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As a general rule, whenever one is concerned with the multiplication of 
complex numbers, it is usually advisable to have them in modulus-argument 
form. If one wished to evaluate the very simple product (2 + ἢ) (1 + 32) 
(= —1 + 72), it would hardly be worthwhile changing into polars; but 
it is nevertheless instructive to do so as an illustration: 


z,=2+i=./5cis 26°34; z=1+4 31 = 4/10 cis 71° 34’ 
=> 2,2, = «(50 cis(26° 34’ + 71° 34’) = ./50 cis 98° 8' 
= 5,/2 cis 98° 8’, 
which will be found to agree with —1 + 71. However, if one wished to 


evaluate (1 + 37)’, it would be well worthwhile using the polar form to 
avoid the work with the binomial theorem. Comparing the two methods: 


(a) (1 + 32)? = 1 + 7C,31 + 7C, (37)? + 7C5(37)? + 7C,(37)4 

+ 7C,(31)? + 7C5(32)® + (ἡ 

1 + 212 + 18927 + 9457° + 2 8951} + 5 1031 
+ 5 1032° + 2 1872’ 

1 + 211 — 189 — 9452 + 2 835 + 5 1032 — 5 103 
— 2187: (putting 1“ = —1) 

—2 456 -- 1992); 

(./10 cis 71° 34’)? = (,/10)’ cis(71° 34’ + 71° 34’ 
+----+ 71° 34) (adding the argument 7 times) 
1 000/10 cis 500° 58’ = 1 000,/10 cis 140° 58’ 

3 162(—0.776 7 + 0.629 71) = —2 457 + 19912. 


I 


I 


il 


(6) (1 + 32)’ 


i 


2 


The second answer is less accurate, due to the use of 4-figure tables; but 
its advantages are plain, and would of course increase if the exponent, 7, 
had been larger. 


The unit circle 


An important special case is that of numbers of unit modulus (i.e., of the 
form z = 1 cis 8) lying on the unit circle, ΙΖ = 1. These are multiplied 
by simply adding their arguments, i.e., by performing a rotation about 
the origin on the Argand diagram. This will be clear since 


Z\Zo = cis 0, cis 6, = cis (0, +03), 


so that in an Argand diagram z,Z. may be obtained either by rotating 
about O from position z, through 04, or from Zz, through 6, (positive anti- 
clockwise). In particular, multiplication of any complex number by 
i(= cis 90°) is represented by a positive rotation through a right angle. 
This leads us to consider the general interpretation of multiplication on 
the Argand diagram. Suppose we consider again the product z,z, where 
z,=24+1, z2=1+4+ 31 Then 2,2, = me cis & X 10 cis 6, where 
a = 26° 34’, B = 71° 34’. If we start with z,, we may multiply by Zz, in 
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two stages: first by «10, then by cis β. Multiplication by /10 clearly 
corresponds to an enlargement from the origin by a factor ./10 and multi- 
plication by cis β causes the argument to be increased by f to the value 
a + β, i.e., corresponds to a rotation through β, which brings us to the 
final position z,,/10 cis B = z,z,. More generally, multiplication by 
r cis 8 corresponds to enlargement from the origin by a factor r, coupled 
with a rotation, through @ about O. Evidently the two transformations — 
enlargement and rotation, both about O — commute, and the composite 
transformation is known as a spiral similarity. 


We now see how a geometrical construction may be used to determine 
the position of the point Z representing the product z,z, when the points 
Z,, Zo representing Z, , Z, are given. Letting U be the unit point, we have 
OU = 1, OZ, = γ,, OZ, = rg, OZ = rr so that OZ/OZ, = OZ,/OU. 
Since each of 2.5 UOZ,, Z,OZ is equal to 0,, it follows that As UOZ, , 
Z,OZ are similar, so that the position of Z may be constructed by making 
L£Z,O0Z = LUOZ, and 2 OZ,Z, = LOUZ,. 
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Reciprocal of a complex number 


We have already seen that the reciprocal of the complex number x + ty 
is (x — ty)/(x* + y*), the result being obtained by the use of the con- 
jugate x — ty. However, it is somewhat easier to consider the reciprocal 
of a complex number when it is expressed in polar form. 

l Ι cis (— θ) cis(—O) 1 


© 2 «ores rcis θ cis (— ΟῚ} r cis 0 r ὑπ" 


z=rcis0 


Note that cis θ and cis (— 6) are reciprocals, 1.e., that 
(cos θ + isin 8) (cos θ — isin θ) = 1. 


To plot the point on the Argand diagram representing the reciprocal of a 
given complex number z, we simply make 2 UOZ’ = 6, measured clock- 
wise [since arg (1/z) = -- θ7, and OZ’ = 1/r, where U is the unit point. 
Since OZ/OU = OU/OZ', and LUOZ= LZOU, triangles ZUO, 
UOZ’ are similar; so to perform the construction it is only necessary to 
make  UOZ’ = LUOZ and LOUZ’' = LOZU. Of course, Z repre- 
sents the reciprocal of the complex number represented by Z’: reciprocals 
are interchangeable. 
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Division of complex numbers 


_ ΖΙ ; ] 
If we require —, this may be written z, Χ —, and we can now use our 
Zo 29 
knowledge of reciprocals: 


Ξι} δὶ γι cis 0, xX ere = {, a *) cis[O, + (—@)] 


Zo 7. 2 


=a cis(0, — 6,). 


7. 


So the rule for division is to divide the moduli, and subtract the arguments — 
the reverse of the rule for multiplication. Or we may write: 
21 


_ al, ar Bly tee — ar 
= πῇ arn = ὃ Ζι arg Ζο. 


Ζο 


This is what one would expect since division is the inverse process of 
multiplication. A useful exercise would be to evolve a construction for 
υ, ἢ ; - 2 ; . ba 
arriving at the point representing — starting from given positions of z, , Zo. 
Ζ 


2 
This will in effect be a combination of the processes in the previous sections. 


Example 2 

Evaluate cares) using polar form (compare p. 113) 
i—-2= -- - ἐπ /5cis 153° 26’ 

= (i — 2)? = 5 cis 306° 52’. 

Also, 831 ++ 1 = «(10 cis 71° 34’. 

So (i — 2)7(3¢ + 1) = 5,/10 cis 378° 26’. 


Finally, i — 3 = —3 +i = ./10 cis 161° 34’ 
(2 — 2)7(3i + 1) 510 cis 378° 26’ 
i- 3 4/10 cis 161° 34’ 
= 5 cis 216° 52’ = —4 — 531. 
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Example 4 
1 — cis n6 1 — cis πθ΄ 
Simplify _————— d {————— 
δ ταν τυ ([Ξ-.5 
1 —cisn® 1 -- οοβ ηθ —isinn® — 2sin* 2ηθ -- 2isin gnO cos 2ηθ 
1 — cis 6 l1—cosO—isin@  2sin?40 — 2isin 40 cos 10 


sin nO (sin ἐπθ — 7 cos 5n0) 
sin 50 (sin $0 — i cos 50) 

sin 4n6 sin 40 x --ἰ (cos $nO + 7 sin 578) 
sin 40 sin 10 x —i (cos 50 + isin 36) 


sin 9n0 — cis 3nO __ sin gn 


cis s(n — 1)0. 


sint@ ~ cist@ sin 30 
1 — cis n0\° in 30 \° 
It follows that sonnaca = ua ai cis $(n — 1), 
1 — cis 0 sin 50 


the argument being trebled when the number is cubed. 
Alternatively, 
1 — cisnO _ cis xnO[cis ἐπθ — cis (— nO) ] 
1 — cis ™ cis 30 [cis $0 — cis(—36)] 
_ cis 5nO χ Qisin znO _ sin 3nO 


cis 20 x 2isin 56 sin 50 


Χ cis s(n — 1)8. 


{Writing 1 + cis θ in the form cis 36[cis 


20) + cis 107 is often a very 
useful device. } 


* Example 5 


Show that if triangle ABC is equilateral with the sense of rotation from 
A to B to C being anti-clockwise, then a + 6@ + cw” = 0, where ὦ = cis 47; 
and conversely. 


120° 
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Since AABC is equilateral, CA is obtained from BC by a pure rotation 
through 120° = ξπ about C. 
So a—c= (c — δ) cis 3n, 
=> a-—c=a(c — δ), with the notation above, 
=> atowb—-(lt+oa)c=0. 
Now @ =1 => (ὦ -- Ι)(ω - ὦ Ὁ 1) Ξ 0. 
But wo #1, so w*+a4+1=0, 
(which may also be deduced from the fact that the three points 1, ὦ, ὦ 
form an equilateral triangle about the origin). Replacing (1 + @) by 
—? gives immediately the required result, that a + wb + w*c = 0. 

Conversely, if it is known that a + wb + w*c = 0, then we may write 
a+@b+(-l-@)c=0 => a-c=ale — J), 

> a—c=(c— δ) cis4n. 

Hence CA is obtained from BC by a positive 120° rotation, and AABC is 
equilateral, in an anti-clockwise sense. ‘The reader may also verify that the 


condition a + w*b + wc = Ὁ is necessary and sufficient for the triangle 
ABC to be equilateral in the opposite or clockwise sense. 


* Example 6 


Prove Napoleon’s theorem, that if equilateral As BCP, CAQ, ABR are 
drawn on the sides of any triangle ABC, with P, Q, R outside the triangle, 
then the centres of these three triangles form another equilateral triangle. 


Vv 
P 


If APCB is equilateral with anti-clockwise sense, 


p+ ac+ wb = 0. 
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Similarly, 


qg+@a+t+ o*c = 0, 
r+ob+ o*a= 0. 


But if L, M, N are the centres of As PCB, QAC, RBA, respectively, then 


3l=pt+et+, 
3m =qtate, 
3n=rt+b+a. 


So 3(1 + wm + w?n) 
=(ptet+b)+o(gtato+o*(r+b4+a) 

= (p+ wc + w*b) + w(q + wa + wc) + w?(r + wh + wa) 
= 0. 

Hence ALMN is equilateral. 


Exercise 10.2d 

1 Simplify: 

(1) 4cis 150° x 5cis 80°; 
cos -- 1 sin 0 

cos @ + isin φ᾽ 


(111) cis@ x cis(—@); 


(iy Lae 37 \7 
iv cos — in—]: 
7 Ϊ 5 1]: 


(vy) (ι -- 1)(1 -- κ.3}) by first expressing each in the form r cis 0; 


(va) (1 + cosa + isin a)(1 + cosa — ising); 
(viz) (1 — cos B + isin B)?; 
(x) cis(—@) + cis 20; 
] 
(x) cos@ + 2sin 8 + —————_;; 
cos 8 — isin 6 
(xi) cisa x cis B + (cis y)*; 
.. 1 — cos@ + isin 6 

(x22) ---------------------.- 

1+ cos? + isin 6 
2 Prove that bara ι 2 = Iz, [Ὁ] zal ah Ze 
and that arg (Z;ZoZ3...Z,) 


= arg Zz, + arg Zo + arg Z, +---++ arg Z,. 
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If z = cis 0, express in polar form: 
3 > 


ΖΞ. 3 ες Ζ τὰ ὦ ἐπ ἢ 
πε, (111) ἘΠΕ, (10) 


G@) l+z; (ὦ) 


Z—~1 


4 Given the position of a point z on the Argand diagram, show how to 
construct the positions of: 


32, --2Ζ, [χ΄, ἴΖ, iz). --2ῖζ, 2°, az, χ' Ὁ 21, (Ζ -- 1). 


5 Ife = (1 -- ἣα + kb, prove that when ἀ is real, C lies on AB and 
divides AB in the ratio k:(1 — k); while when & is complex, the triangle 
ABC has AB: AC = 1:\k|, and the angle of turn from AB to AC = arg k. 


6 Show that the triangles ABC and A’B’C’ are directly similar if and 
only if 
ee a a 


b~c δ -- ε' 


and that this may be expressed as δ᾽ α'(ὁ — c) = Ο; oras 
ab c\=O (see 13.5). 


7 Use the result in the first part of no. 6 above to deduce the condition 
for triangle ABC to be equilateral, in the form: 


a’ + b* + οὐ — be -- ca — ab =0 (compare Exercise 10.2b, no. 5). 
Furthermore, show that this is equivalent to 
(a + wh + w*c)(a + w*b + we) = 0 (compare Example 5). 


8 Given a triangle ABC, points L, M, N are chosen so that A’s BCL, 
CAM, ABN are all directly similar. Prove that AABC and ALMN have 
the same centroid (use the first part of πο. 6). 


10.3 Complex conjugates 


We have already noted that the roots of the quadratic equation 
x* + 6x + 13 = Oare —3 + 41], and that these complex numbers appear 
on the Argand diagram as mirror images in the real axis. 
Complex numbers having this property are called conjugate. 

More generally, if 
z=x+y=rcis 0, 
we define the complex conjugate of z, denoted z*, as 
z* =x — y=rcis (—8), 
and on the Argand diagram z and 2z* will be reflections in the real axis. 
We might also write (x + iy)* = χα — wy. 
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Note that: 


(a) When y = 0 (1.e., the complex number is real) then z is its own con- 
jugate: real numbers are self-conjugate. 

(b) When |z| = 1, z = cis 0, z* = cis (—8), and in this case conjugates 
are reciprocals. 

(c) Both the sum and the product of a pair of conjugates are real numbers: 


z+ z* = (x + yy) + (x — zy) = 2x = 2Κ(2) 
zz* = (x + ty) (x — y) = x? + y? = |2)*; 
however, by contrast, 
Ζ -- σῇ = 2y = 2/1 (2). 
The reader can quickly verify that for any complex number z, 
(2*)* 


AlSO., εἰ 2; 25) Sy ae ἈΠῸ 2525) 2725. 


2. “and. eng = ae 


The result zz* = |z|* has already proved useful when simplifying expres- 
sions, by applying it to make the denominator of a quotient real. For 
example, 

1} 7 CSP BO a Oo 14; 
SO ΞΞ = te 
24 (254) 2S) 5 

As we observed at the beginning of this section, the roots of the quad- 
ratic equation z* + 6z + 13 = 0 are a pair of conjugate complex num- 
bers. Clearly this is bound to happen when solving any quadratic equation 


az? + bz +¢ = 0, ifa, b,c are real and δ΄ < 4ac. 


= 2 Aac — p? 
For z= b+ /(b ἡ) ὃ , v (tae b*) 
2a 2a 2a 


which is a conjugate pair provided that the coefficients a, ὃ, c are real. (To 
show the necessity for this last condition, we can consider the equation 


1, 


2+2(/-l+i)+(2+7) =0 
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which are certainly not conjugate.) 
Again, the equation 
z> — z* — 2 — 15 = 0 (with real coefficients) has roots z = 3, —1 + 22, 
—] — 22, i.e., one real root and a pair of conjugate complex roots. 
Furthermore, the equation 
χ᾽" — 224 + 42° — 227 + 8 =0 (with real coefficients) 
may be written as 
(z + 1)(z* + 4) (z* -- 22+ 2) = 0 
and so has roots —1, +27, 1 + ἢ, and again the complex roots occur in 
conjugate pairs. 
We now prove that, for any polynomial equation with real coefficients, 
the complex roots occur in conjugate pairs. 
Let the polynomial be 


F(z) = a,z" + ay ale Sila 4,2 τ ἀρ 


and suppose that z is a root, so that f(z) = 0. We therefore have to prove 
that z* is also a root. 


Now f(z) = ἀμί)" + ag «(25)"  Ἔ .τ. ay2¥ + ay 
= at(24)" + at, (2t)" 1 7 χω. 


(since all the a, are real and therefore self conjugate). 


But a¥(z*)\" = a¥(z")* = (a,z")*, εἴς, 
Hence f(z*) = (a,z")* + (a,—,2" ΤῈ +:++++ (a,z)* + a® 


= (a,2" + α,. 12 ἷ +++++ 412 + a)* 


So whenever Ζ is a root, so is z*; and we see that the roots of such equa- 
tions with real coefficients always occur in conjugate pairs (or as real 
numbers, which are self-conjugate). 


Example 
Solve the equation 
χ᾽ — 3z* — 19z + 13 = 0, 


given that one root is 2 + 31. 
As the coefficients are real, another root is 2 — 37. So if the third root is «, 
the sum of the roots of the equation gives 
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a+ (24+ 32) + (2 — 31) = —(-—3) = 3 
> a= —-l. 


Hence the roots are —1, 2 + 32, each of which may be checked in the 
original equation. 


Exercise 10.3 


Prove that: 
We ey Fe Z)* a Zens 
1 


I 

(2 

(1) (Zy — Zy)* = zt — zB; 
i 


| 
N 
a 


1) (Z429)* = σῇ; 


(iv) (2) ἘΞ ἘΠῚ 
ΖΩ ζῇ 
2 Extend the above results to: 
Oe) = Σ ὠς ΞΖ 
4 Prove that z¥z, = (z,z%)*, and verify by putting: 
2) = 7s: 0; ; Zo = rp cis B,. 
4 Ifz=rcis @, show that 
f=) (22*), θ = 5(arg z — arg z*) 
and also that z* + (z*)* = 2\z|* cos 20. 
Prove that argz =O => z= 2* cis 20. 
Is the converse true? 
5 Use conjugates instead of moduli and arguments to express: 
(1) ΙΖ -- 1] = ΠΣ]: (ἢ) arg(z — 1) — arg(z+ 1) = ξπ. 


6 Ifz, and z, are complex numbers such that Ζ, + Z, and z,z, are both 
real, prove that ζῇ = Ζ.. 


] 
7 Prove that the area of the triangle OAB is γε (a*b — ab*), and that 
i 


the area of the triangle ABC 1s 


1 Jt ] 


—la b c¢ 1} (see also 13.5). 
a 08 CF 


8 Given that 1 — 115 a root of 
92° — 7χ + 10z —-6=0, 
find the other roots. 


Τ [] denoting product, just as ΝΣ denotes sum. 
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9 If2 4+ 115 ἃ root of 
9z7 — 927 + 142 -- 5 = 0, 
find the other roots. 
10 Given that z + n (πε Z*) isa factor of χ᾽ + 6z7 + 16z + 16, find 
n and then factorise completely. 
11 Solve the equation 
z* — 3z3 + 4χ — 32 - 1 Ξε 0, 
given that one root is 2(] -- /3i). Solve also by making the substitution 
= z+ 1/z. 
12 Solve the equation 
z* — 627 +25 = 0. 
13. Solve ζῇ — 6z° + 23z* — 342 + 26 = 0, given that one root is 
1+ 2. 
14 Show that the equation of the circle 
(x — a)? + (y — B)? = p? 
can be written as 
zz* — p*z — pz* + ρὲ — p? = 0, 
where p = a + if. 
15 [0 15 the orthocentre of the triangle formed by a, ὁ, c, prove that: 
bc*® + b¥*¢ = ca®* + οἷα = ab* + a*b. 
16 Prove that the reflection of A in the line BC is the point A’ given by 
a*(b — c) + b¥e — bc* 


ΡῈ — ¢F 


f — 


17 In Exercise 10.2a no. 6, we obtained the condition for two triangles 
to be directly similar. What is the condition for opposite similarity? 


10.4 de Moivre’s theoremt 


Having established means of obtaining the sum, difference, product, and 
quotient of two complex numbers and their interpretation on the Argand 
diagram, we now investigate their squares, cubes, square roots, cube roots, 
and other powers. Initially we shall restrict ourselves to the rational powers 
of complex numbers of unit modulus, i.e., to (cis 0)", where né Q. 


Caser neZ* 


We know that (cis 0)* = cis 8 x cis 0 = οἰβ(θ + 0) = cis 20 


+ After Abraham de Moivre (1667-1754), born in France but resident in England for most 
of his life. 
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(cis 0)? = cis @ x (cis 0)? = cis @ x cis 20 
= cis 39, andsoon. 


More generally, Vn € Ζ΄, (cis 0)" = cis n8, which can be proved form- 
ally by induction. 


Case2 nEeZ” 
Since cis @ x cis (—@) = 1 


(cis 8)~' = cis(—6). 


] ] 
Again, is 9) 2 = ———_ = —— = cijs(— 26 
a ey (cis 8)? οεἰς 20 aes 
(cis θ)." cis(—30), et 
= -ο--------- => ----- = C1 --- ᾽ 
(cis θ)}2: οἰβ380 Ὁ eas 
So, more generally, (cis 0)" = cisn0, wheren = —1, —2, —3,... (Le., 
YneZ ). 
Case3 n=O 


(cis 0)° = 1 = cis 0. 


So again, (cis 0)" = οἷς θ΄. whenn = 0. 


Case 4 n = p/q, a rational fraction, with p,q € Z 


q 0 
AsqeZ τ: 8) = εἰς py xq using the result above, 
q q fp and q being integers 


= cis pO = (cis 8)?. 
So cis P9 is a gth root of (cis 0)”, 1.6., a value of (cis 0)?". 
q 
Summarising these results, we have de Moivre’s theorem, that 


ifne Z, (cis 0)" = cis nO; 


and if n € Q, cis n€ is one of the values of (cis 0)” 


It is necessary to say ‘one of the values of’, for we shall see later that 
(cis 8)?4 has in fact g distinct values. At the moment, we note that, since 


(cis 42)? = cis ἐπ and (cis 3x)? = cis ὅπ = cis 91, 


then, (cis $7)‘/? = either cis ἐπ or cis ὅπ, 
there being exactly two square roots of a complex number, which (not 
unexpectedly) are a pair of negatives. The fact that there are three cube 


roots may be more surprising. However, it is easily verified that cis 120° 
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and cis 240° are both cube roots of 1. Moreover, you may verify that 
9 — i, -2+i,1 + 2i, and —1 — 21 are all fourth roots οἵ -- 7 — 241. 
De Moivre’s theorem tells us that an effect of raising a complex number 
to the power n is to multiply its argument by ἢ. This is a result of far- 
reaching importance, leading to powerful and elegant solutions of many 
problems, and we now proceed to consider some of its applications. 


Example 1 


Use de Moivre’s theorem to find formulae for cos 48 and sin 46 in terms of 
cos θ and sin θ. 


(Here, and in subsequent examples, we shall use the abbreviation ¢ = cos 0, 
s = sin 6). 
cos 40 + isin 40 = (ὁ + is)* = ct + 4c3is + 6071757 + 4157 + 175% 
= (ct — 62s? + 5*) + 1(4c%s — 405°). 

Equating real and imaginary parts, 
cos 40 = ct — 6c2s? + «ἢ = ct — 6e?(1 — οἷ + (1 -- ?)? 

= δεῖ — δε + 1 
sin 40 = 4c°s — 4." = 4cs(c? — 5°), 
The method may be generalised to obtain cos nO and sin nO in terms of 


c and s, starting from (c + is)” = cos nO + isin ηθ. Using the binomial 
theorem and equating real and imaginary parts, we get 


cos nO = οὗ — ΠΟ, 757 + "Cyc" 4s* +--- 
sin nO — no" 's as "Cae + MGs Et ng: “ore: 
It follows that 


sinnO = nc" ἷς — "Ο,ο ὁ." τ... 


tan n@ 


cisn0 co” — "Cyc" 257 4 --- 
_ nt aa a Oa + "Cat ica 
Ι τὴς "Got + OFF i ory og Ae 


(dividing numerator and denominator by οἷ, and using ¢ = tan θ). 


In particular, for example, 


56 = ΤῸ +7 4p = 4e* 
tan 5θ = ——__—_——__-: tan 490 = ————— etc. 
1 — 100? + σ᾽ ὼ Leo ae 0 
Example 2 


Use de Moivre’s theorem to investigate the cube roots of 1. 


1 = cis ἐπ; a Sg Seis on) 
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so one of the values is 
cis ἐπ = cos ἐπ + isin ἐπ = 3(,/3 + 7). 


However, we may also regard 7 as having argument 57/2, 9n/2,..., and 
when de Moivre’s theorem is applied we obtain 7'/? = (cis 52/2)'/? = 
cis 57/6, and again 7'/ = (cis 9x/2)'/? = cis 32/2. The addition of further 
revolutions to the argument of 7 cannot produce any different values of 
i’/?; for example, (cis 132/2)1/? = cis 132/6 = cis 2/6, which has already 
been obtained. Again, if we regard 7 as having argument —3z/2, this 
gives [cis (— 3π|2)7} = cis (—7/2) = cis 32/2, also already obtained. Thus 
the three cube roots of i will be seen to have arguments separated by 
multiples of 27/3, and so appear on the Argand diagram as the vertices of 
an equilateral triangle, at the points P,, P,, Ρ,. 


Example 3 
Find the values of (—1 + 7)?”. 
Now —1 +2 = «(2 cis 135°. 
So, by de Moivre’s theorem, one of the values of (—1 + 1)}}5 is 
(x/ 2 cis 135°): 9° cig 81": 
However, any number of revolutions may be added to the argument of 


the original number, which may therefore be written Ἂς 2 cis (135° + 360° n), 
where n is any integer. 


Thus (--[᾿Κ - ἡ} = -το; cis (135° + 360° ")}"}}Ὁ. 
= 2° cis (81° + 216° n), 
ie. 25:5 cis 81°, 29:3 cis 297°, 20:5 cis 513°, 20:5 cis 729°, 2°? cis 945°, εἰς; 
so, reducing the arguments modulo 360°, we see that 
20.5 cis 81°, 29:3 cis 297°, 29 cis 153°, 293 cis 9°, 29-3 cis 225° 


are five distinct values. 


10.4 DE MOIVRE’S THEOREM 133 


These are shown in the figure, where the arrows show the route followed 
by adding 216° repeatedly to the argument. It is also clear that they form a 
regular pentagon, and that precisely five values exist. 


At this stage it is interesting to reflect on what has been achieved. When 1 
was introduced to represent ,/—1, it might have been expected_that 
further inventions would be required in order to represent ον i, ~/ -- Ἰ and 
Sy) (1 — ἢ), and that a whole range of ‘hyper-complex’ numbers would 
have to be developed. But de Moivre’s theorem immediately shows that 
this will not be necessary, and that the simple invention of 7 as ,/—1 is 
entirely sufficient for dealing with all such complications. 


Exercise 10.4a 


1 Express in the form r cis 8 where the argument has its principal value: 
(1) (eis 72}; (72) (εἰς 100°)~°; 

(11) (28 + 967)?; (iv) (cos 80° — isin 80°)*”; 

(ἡ. 5: 7 5; (vi) (1 + })}9 (two methods) ; 

(vit) (./3 — ἡ)"; (vii) δ + ἡ); 

(Com ψα -- 1.,Δ(25); (x) (sin -- icos θ)"; 

(xz) (1 + tan 0)"; (x22 ) 


(cos 5a + isin 5a)°(cos 3a — isin 3a)’ 


ee 


2 Find τ' 1: (1) by letting its value be x + zy; (11) by expressing in polar 
form. Find also values of τὴ 2 and of < (—1). 


) 
(cos ἃ - 2sin a)*(cos 2a — isin 2a)~? 
( 


3 Obtain sin 36 in terms of sin 6, and cos 38 in terms of cos @ by using 
de Moivre’s theorem. Hence show that if sin 30 + cos 36 = 0, then either 
tan 0 = 1 orsin 20 = —3. 


4 Find the principal value of the arguments of (cis 50°)”, (cis 122°)’, 
and [cis (— 166°)]°. Can you find two other complex numbers which give 
the same result? 
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5 Find as many as possible values of (1 + 7)'/°. 


6 (7) Solve z* — 1 = 0, expressing the three roots in polar form. 
(11) By writing —1 as cis 2, obtain the roots of ζῇ + 1 = 0. 
(11) Find all the values of (cis 1509), 


7 Find formulae for: (1) cos 60 in terms of cos 0; (11) sin 70/sin θ᾽ in terms 
of cos @; (11) tan 50 in terms of tan 0; (10) sin 66/sin θ in terms of cos θ. 


8 (7) If 2 Ὁ χ΄ = 2cos0, show that χ' + z~™" = ὃ cos nO, and find 


(11) Ifz + z~* = —1, find the values of z" + z~" for n = 2, 3,4,.... 
g If cosa + cos fB + cosy = 0 and sina + sin β + sin y = 0, prove 
that ἃ, 8 and’y are separated by 7. Generalise in the case of the equations 


n n 
Σ᾽ cos ἃ, = 0, Σ᾽ sina, = 0. 
1 1 


10 Prove that (1 + cos @ — isin 0)” = 2" cos"$0 (cos Ζηθ — isin 4n0). 
Hence write down the value of (1 + cos 6 + isin 6)" and simplify the sum 
of these two expressions. 


11 Using the expansion of tan 50 from no. 7 above, show that the roots of 
? + 5t* — 100° — 102 + 5t+1=0 are tan (4n — = (néZ), 
and obtain five distinct roots. 

Use the expansion of tan 6@ to evaluate tan 7/12. 


12 Use the expansion of tan 5@ in terms of ¢ (= tan 9) obtained in no. 7 
to solve the equation 


t? — 10¢* — 100° + 20 + 5ὲ -- 2 =0. 

13 Solve the equation sin 50 = sin? 0. 

14 Using the expansion of cos 50, deduce that sin 18° = 4(,/5 — 1). 
15 Prove that 

sin 70 = 7εὃς — 35c+s? + 9105. — 5’? wherec = cos 6,5 = sin 0. 
Hence solve the equation 

7x? — 35x* + 21x -1=0 


giving three distinct roots. 


The nth roots of a complex number 


The above examples illustrate how, in general, the nth roots of a complex 
number may be found. Writing the number in the form z = r cis 0, we 
have (r cis 9)'" = r'/ cis (θη) as one of the roots. To obtain the other 
roots, we write z = rcis(@ + 2k) (ke Z). Then z'” = γ᾽" cis (8 + 27k)/n. 
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It is evident that n different values are obtained by & taking the values 
0,1,2,3,...,2— 1, and that this exhausts the distinct values: when 
takes the value n (i.e. when 277 is added to the argument of z) the argument 
of z'™ will have been augmented by 27, so repeating the value corres- 
ponding to k = 0. So the nth roots lie in the Argand diagram on the circle 
of radius r’ separated by a consecutive angular distance of 2π|η, and form 
a regular n-gon. 
In particular, the nth roots of unity are given by 


k 
(cis 2nk)'/" = cis a eee πο} 
n 


edt 
and these form a regular n-gon in the Argand diagram. If we let ὦ = cis —> 
n 


then the ἡ roots may be denoted 1, ὦ, ὦ, w°,...,@" ᾿ς, with w” = 1. It is 
clear that 1 + w+ w? + wm? +---+ @" ' = 0; and furthermore, since 
1, ὦ, οἷ, w°,...,@" * are the roots of z” — 1 = 0, 


χ" — 1 = (z — 1)(z — w)(z — w*) (z — w’)...(z -- ὡ" ἢ). 


The 11th roots of 1 


The particular case n = 3 is important, and we have 1, cis 42 and 
cis (— 47) as the cube roots of unity. It has already been seen how the cube 
roots of unity can be applied to the solution of a geometrical problem such 
as Napoleon’s theorem, and in Chapter 13 we shall also see how they may 
be used to evaluate a determinant. Meanwhile we provide some simple 
examples on the nth roots of a number: 


Example 4 
Solve the equation z° + 1 = 0 
z° = —1 = cis(Qnk + πὶ. 


: m(2k + 1) 


So, by de Moivre’s theorem, z = [cis (2πά + x)]'/° = ci ᾿ 
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Taking k = 0, 1, 2, 3, 4, 5 we obtain z = cis 2/6, cis 2/2, cis 52/6, cis 77/6, 
cis 37/2, cis 1172/6 


y 


Example 5 
Solve the equation (z — 1)? + (z+ 1)? =0 


=] \2 
(: = --͵Ἰ =cis(2k + 1)z. 


n (k =0,1,2,3,4). 


Writing ἃ Ξ 


1+ cis a 
Zz = --------- 
1 —cisa 
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cis 5a [cis $a + cis(— za) ] 
—cis 4a[cis 3a — cis(—30)] 
2 cos $a 


Ἔξ τὸν icot sa = icot 
— 21 sin σὰ 


(2k + 1ὴπ 
10 


So the roots are (putting k = 0, 1, 2, 3, 4): 


2 pak on ee π oe π 

=— —9 —-9 —9 1 (Ο 5 Ἐπ 

Ζ 1 (ΓΟ 10 1 CO 10 1 CO το 10 10 
ae π ; ΕΝ 0 pas π Ε τοὶ π 

ΞΞ ---: "3 —2CO = 955 τ τ 
et ee 10 ΤῊΝ 10 ὶ ' 10 10 


0. +2 ge +1 ML 
=> = ἘΞ τ ὃ 10οῖ---- 
eae nh, eae ag. See 


(We expect an equation of degree 5 to have 5 roots. Note that there is one 
real root, together with two pairs of conjugate complex roots.) 
*Example 6 
Factorise x*" — 2x" cos πὰ + 1. 
If x2" — 2x" cos na + 1 = 0, 


then (κχ" — cos na)? + sin* na = 0 


giving x” = cosna + isin na = cis[+(na + 2kn)] 


| +( “5 
Ξ x = cis| || ἃ + ——]]- 
n 


For k = 0,1, 2,...,2— 1, we obtain 2n roots, and these also occur in 
conjugate complex pairs. So the expression may now be expressed in 
factors: 


n-1 k k 
x2" — 9x" cos na + | I] > - Ἵ salle - cis( - a — =) 
6 n n 
aaa ee Q2kn 
x" — 2x cos ON aes +14], 


combining the conjugate pairs of factors. Thus the expression has been 
converted to the product of n real quadratic factors, and once again we have 
achieved a result in real algebra by the use of complex numbers. 


Ι 
° 


Further examples on the use of de Mowvre’s theorem 
Example 7 


Express cos* @ and sin° 9 in terms of the sines and cosines of multiple 
angles. 
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We note that z=cis0 = z ' = cis(—6). 

So z+z '=2cos@ and z—z ' = 21 5η θ. 
Also χ' = cisn@ and z "= cis(—n6). 


So z7+2z2"=2cosn8; χ' —z" = 21 5:ὴ nd. 


1\4 
Hence (= +2) 7) ho Ag? 1h 3. 42 ΕΖ 
Ζ 


=> (2 cos 0)* = (σὴ + Ζ ἢ + 4(z2* +277) + 6 
= 2cos40 + 4 x 2 cos 20+ 6 

=> 16 cos* 8 = 2 cos 40 + 8 cos 20 + 6 

=> cos* θ = (cos 40 + 4 cos 20 + 3). 


l 5 
Also (: - =) = (χ᾽ — στ — 5(z7 — z79) + 10(z — z7') 
Ζ 


=>  (2isin θ)" = 21 5η 50 -- 5 x 2isin 304+ 10 x 2151ηθ 
=> 32 sin? @ = 2 sin 50 — 10 sin 30 + 20sin 9 (sincez* = 1) 
=> sin? 0 = 7; (sin 56 — 5 sin 30 + 10sin 8). 


Example 8 
Find the sum of the series 
S = sina + sin(a + B) + sin(a + 28) +---+ sin[a + (nm — 1)β]. 
Letting C = cosa + cos(a + f) + cos(a + 28) 
+--+ cos[a + (x — ΠῚ, 
wehave C+ iS = cisa + cis(a + B) + cis(a + 28) 
ἘΠῚ ΤῈ cisfa + (nm — ΠβΊ; 
which may be written 
C+ iS = cisa + cisacis β + cisacis 2B +---+ cisacis(n — 1) 
= cisa[l + cis B + (cis B)* +---+ (cis B)"~"] 
cis [1 — (cis py"). 


being the sum of a g.p. 


1 — cis B 
isa(l — cl 
= Os sa), using de Moivre’s theorem. 
1 — cis B 


Using the result of p. 122, this gives 
sin 5np 
sin 5B 


Finally, equating real and imaginary parts, 


C+ 8 = cis[a + $(n — 1). 


sin Ζηβ. 


sin nf τ 27? sin[a + 2(n — ΠῚ. 
sin op 


C= cos[a + $(n — 1)f]; S= 


sin df 
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In the special case when « = β = θ, we get 


sin 5n6 


cos 3(n + 1)0. 


cos θ + cos 20 +--++ cos ηθ = — 
sin 50 


Exercise 10.4b 


1 Show that (1 + 32)* = 28 — 96i, and find the other fourth roots of 
28 — 9627. 

Find: 

the cube roots of § (cos 60° + 7 sin 60°) ; 
the fifth roots of 4. 2 οἰ5(-- 80); 

the eighth roots of cos 160° — 7 sin 160°; 
the ninth roots of 7; 

the four values of (1 — J/3 ae 

the sixth roots of -- /3 : 

(ays 

four values of z to satisfy z* + 1 = i. 


~™. τ, N 
φῶς 
πϑινς 


al 


ON NNN NN 
Θ -- ™~. τὦς 
ὼ Sy Soe oS Se 
aed — 


Se 2& 
So, SS 
~~, &™, 
™. WU 


. 
“—” 


3 Solve the equations: 
(ἡ) 2+8=0; 
(1) 2+erP+¢z274¢74+274+2z24+1=0; 
za—-ze7+2z*—z+1=0; 
z>—1=0; 
oo 92 125 =, 
4 Solve the equations: 
Gy (22 -- 1)" ΞΞ (Ζ -- 2)"; (i) (Ζ - i)* = 81z*; 
(i) z2° — 277° +2=0 (1υ) χ᾽ -- χ᾽ -- Ζ -1-Ξ- 0 


In Exercises 5 to 9, take ὦ to be a complex cube root of 1. 


5 Prove that: 

(@) (l+q@)(1 + 7) =1; (1) (l+o*)?  -; 

(111) Simplify (1 + @)(1 + 2@)(1 + 3@)(1 + 5ω). 

6 Prove by reference to the Argand diagram that 1 + 2” is purely 
imaginary. 


7 Form the quadratic equations whose roots are: 

(()3+0,3+ 0°; (1) 20 — w*, 2ω" — ὦ. 

8 Prove that 1 + ω" + οὐ" takes the values 3 or 0 according as n is or 
is not divisible by 3. 

9 From the identity 

a® + b? οὐ — be — ca — ab = (a + bw + cw?) (a + bw? + co) 
(a,b,c EC), 
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show that if Σ᾽ a? = > bc, then either a — ὁ = ὡ(ς — δὲ) or else 

a —c¢ = q*(c — δ) and interpret in the Argand diagram. 

10 Express the following in terms of multiple angles: 

(2) cos? 0; (11) sin® 0; (11) sin’ 0; 

(7v) sin? @ cos* @ in the cases where (p, g) = (3, 4), (4, 3), (4, 6), (3, 7), εἴς: 


11 Use the results of no. 10 to find the integrals 
| sin 6d@ and | sin* @ cos® 6 dé. 
0 
12 Prove that 
sin ἐπα cos$(n — l)a 
1 + cos a + cos 2a +-+++ cos(n — ᾿)ὰ =—=———=—__- 
51η οὐ 
1 as i ol 
cos ἃ + cos 2a -Ἐ - - - -Ε cos no 
Obtain formulae for cos ἃ — cos 3a + cos 5a -- - - - cos(2n — l)a 


and sin αἱ — sin 3% + sin5a —--+ sin(2n — 1l)a. 
13 Use the result of no. 12 to find a formula for the sum of the series 
sin ἃ + 2 sin 2a + 3 sin 3a +---+ nsin πᾶ. 


14 Find the sum to infinity of the series 

S = cos θ sin θ + cos* θ sin 20 + cos? @ sin 30 +--:- 
(consider the g.p. Σ᾽ cos” θ cis 70). 
10.5 Euler’s equation: cis θ = οἷ 


Let us now consider the function y = cosx + isin x. Continuing to 
treat 1 like any other number, we have, by differentiation: 


d ie 

Fils —sin x + 2cosx = i(cosx + isin x). 
dy 

H Ee se 

ence ἘΞ ry 

dx l 

=> 1 ---- = -- 
ἂν y 

=> x=Iny+e 


10.5 EULER’S EQUATION: cisO=e% 14] 


It may be objected that we have nowhere defined the meaning of a number 
such as é when z is complex; but the reader should not be deterred by such 
inhibitions! Indeed, the above paragraph may be regarded as providing, 
if not a definition, at least a reasonable exposition of the meaning of 
e™ making it consistent with the familiar processes of mathematics. 
For example, the rule for the multiplication of complex numbers: 
cis 0, Χ cis 0, = cis (0; + 0) may now be re-stated: e! x εἶθ5 = gerne 02} 
which is just what we should expect in a well-behaved system, ‘imaginary’ 
powers of ὁ obeying the same rules as real powers. Indeed, the rule for 
addition of arguments (10.2) when multiplying complex numbers is now 
seen to be no more than a disguised version of the rule for addition of 
indices, or logarithms! 

Again, de Moivre’s theorem may now be rewritten in the perhaps more 
familiar form (e*)" = εἶπα, and, as further support for the truth of the 
statement εἶδ = cis x, it will be remembered that we obtained the following 
infinite series by using Maclaurin’s expansion: 


te a aes ; x x 
cosx=1l1—-—+—-—°::3 sn x τΞξ αὶ -- --- -Ἡ --- -πο τ τ. 
] 4} ! 
: 22 ne i 
Then cis x = cosx + 1810 αὶ = ae ae we 


=l+m— a re ++ 

ame CM 
- (1x)? (ix)? - 
Ξ 11 (1) x εἰ 3 εὐ πὰ 


being the infinite series for 65, with z substituted by ἴχ. This further en- 
courages us to accept the validity of the expansion of the exponential 
function as an infinite series in cases when the exponent is imaginary. 

The above is neithera complete nora conclusive argument: a great deal of 
analysis, involving the use of the exponential seriese? = 1 + z+ ΖΗ [21] Ἐ- 
is needed to make the definition of e’* watertight. Meanwhile, however, we 
shall proceed on the basis that the above broad outline makes sense. 

The most remarkable result may be obtained by putting x = 7: 


e*"=cistn = —1 +0: = --Ἰ. 


so we have derived the equation due to the great mathematician Euler: 


which unites in one spectacular relationship 
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the five most important numbers in mathematics: 0, 1, m, e and 7. 


Sense or nonsense? It must be true! Certum est quia impossible est: 
Nothing so incredible could be an invention! 


Example 1 


Integrals such as {e** cos bx dx and fe** sin bx dx are usually found by two 
applications of integration by parts, a tedious procedure. We now look at 
a way of avoiding this tedium by using the exponential form of the complex 
number, working the method in a numerical case. 


Let C= ἐν cos2xdx and S$ = pe sin 2x dx. 


Then C+ i = fem (cos 2x + isin 2x) dx = ie eo ἂχ 


(-3+2i)x 23 τ my} 
= (~3+2i)x qy — ἔ = = 3X 21x 
|. no? ED, ἘΝ 
Ξ ee ae 
Sa ~°* (cos 2x + isin 2x). 


13 


Separating the real and imaginary parts, we obtain C and S: 
— 3x — 3x 


C=: (—3 cos 2x + 2sin 2x) and S=— 


13 


(—2 cos 2x — 3 sin 2x) 


(arbitrary constants have been omitted). These perfectly real results have 
been achieved by the use, not only of complex numbers, but of exponential 
functions of complex numbers. Their truth should be verified by direct 
differentiation. 


Exponential functions of the general complex number 
Ifz=x+y,thn 65 - οἷ" =e εἷν =e cisy 
=> [05] = e* and arg δ = y. 
Moreover, we may now say that In (e* cisy) = x + wy, 
or In (7 οἱβ θ) = Inr + 20-(+ 2 ani). 
For example, 

In (—3) = In (3 cis 2) = In 3 + 1π. 


Note that the logarithm of a (complex) number has an infinity of values 
corresponding to the infinity of arguments, e.g., In (—3) = In 3 + 321i, or 
In 3 — 5n21, etc. 
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As a further application, suppose we want to find 7’. We may write 
i= εἶδ ἐπ =e 


Soi! = (e#!")' = e 3" & 0.207..., the value, surprisingly, being real. 


Since cosx + 1510 χ = e™, 


and cosx —zsinx =e ™, 


we deduce, by adding and subtracting, that 


and discover the astonishing revelation that trigonometric functions, which 
started life as ratios of sides of a right-angled triangle, are really exponential 
functions in disguise. Moreover, these exponential versions of the trigono- 
metric functions bear a striking resemblance to the definitions of the 
hyperbolic functions (see 9.9): 

x a, x πα 
cosh x = ee jhe So Ἐν 


2 Ζ 
Hence we may now state the following relations connecting the circular 
and hyperbolic functions, which the reader may verify by direct substitu- 


tion: 


cosh ix = cos x sinh 2x = 2s1In x ; 


cos ix = cosh x sin 2x = 2sinhx 


We have already, in 9.9, noted the close resemblance between trigono- 
metric and hyperbolic formulae, e.g., for cos(a + 6) and cosh(a + ὁ), 
and we are now in a position to see how this arises. 


For cosh(a + 6) = cosz(a + 5) 
= (08 ia cos 1b — sin 16 sin 1b 
= cosh a cosh ὦ — (1 sinh a) (2 sinh ὁ) 


=> cosh(a + ὁ) = coshacosh ὦ + sinha sinh ὁ, 


compared with 


cos(a + 6) = cosacos ὦ — sin asin ὦ. 


More generally, we can summarise these resemblances in Osborn’s rule, 
that the formulae correspond exactly provided that the sign is changed 
whenever there is a product of two sine or two sinh functions. ‘Thus, again, 
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the formulae 


cos 2x = 1 — 2 sin? x cos? x + sin* x = 1 
1 + 2 sinh? x cosh? x — sinh? x = 1 


cosh 2x 


sin 2x = 2 sin x cos x 
sinh 2x = 2 sinh x cosh x 


may all be quoted in support of Osborn’s rule. 

We may also see that our statements regarding sin @ and cos @ as expo- 
nential functions are consistent with the various formulae in ‘real’ trigono- 
metry. For example, sin(@ — ¢@) = sin θ cos @ — cos @sin ¢; and if we 
substitute the exponential versions in the R.H.S., we get 

εἶθ. 9-0 ib 4 i gO 4 8 fib _ ib 

21 2 2 21 
cs 1 (ei tib τ θεῖφ 4 ἰθτιφ _ ,-i8-ib ἰθεῖφ -ιθειφ 
41 
4 εἰθτῖφ 4 ,-i0-id) 
i(0—¢) _ ,-i(0-¢) 
— 1 (1 9::Φ) 1:5. De [(θ- Φ)) = Alas rae es : ’ 

41 21 
which is the exponential version of sin(@ — @). A similar verification may 
be achieved with any of the trigonometrical formulae. 

With the above equipment, we may now even make sense of trigono- 
metrical functions of complex numbers: 
sin Z = sin(x + iy) = sin x cos ty + cos x sin ly 

= sin x cosh y + cos x(7 sinh y) 

= sin x cosh y + 1 cos x sinh y, 
so that sin z is a complex number, and we have separated it into real and 
imaginary parts. 


Example 2 
Find cos ' 2. 
Suppose cos ᾽2 = x + wy. 


Then cos(x + 12) = 2 
=> cosxcoshy —7sinxsinhy = 2 
> cos x cosh y = 2 (1) 
and sin x sinh y = 0 (11) 


From (72), sinx =0 or sinhy = 0. 


Case I 


snhy=0 > y=0 = cosh y = 1. 
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Substituting in (7), cosx = 2. 


But x is real, so this is impossible. 


Case 2 

snx=0 => x=nn(ne€Z) => cosx = (-1)". 

Substituting in (7), (—1)"coshy = 2. 

But yisreal,so coshy >1 => πὶ iseven(= 2k), and coshy = 2. 


Hence x = 2kn and y =cosh 12 = -ἴη (2 + a0) 
and finally οο5 12 = 2ka + iin (2 + ./3). 


Example 3 


For a circuit containing resistance R, inductance LZ and capacitance C, 
in series, the differential equation satisfied by Q, the quantity of electricity 
which has passed at time ἐ, is LQ + RQ + Q/C = V, cos wt (where 
Vy cos wt is an alternating input E.M.F.). Suppose that for a particular 
circuit, the differential equation arising is 


ᾧ + 6Q + 250 = 60 cos δέ, 


and that we wish to find the current flowing at time ¢. 
Seeking first the complementary function (see 9.8), we obtain the 
quadratic equation k* + 6k + 25 = 0,so thatk = —3 + 41. 
Thus the c.f. is Q = Α εἰ ~F 749 + Bel 3-408 = 6 3t(4 ett 4 Be Ait) 
e ΓΑ (cos 4t + isin 4t) + B(cos 4 — isin 4t)] 
=e “(Ὁ cos 4t + D sin 4t), 


where C = A + Band D = τά — B) are a fresh pair of arbitrary con- 
stants to replace the pair A, B. 

The particular integral will be of the form Q = L cos 5t + M sin 5t. 
Direct substitution back into the differential equation enables us to find 
the values of L, M, namely L = 0, M = 2, so the complete solution of the 
differential equation is 


Q =e (Ὁ cos 4t + Dsin 4t) + 2 sin 51. 


Hence the current J = Q = e  * (Ecos 4t + F sin 4t) + 10 cos 5t, where 
E, Fare yet another replacement pair of arbitrary constants. Their values 
could be found from two pieces of information, such as the initial charge 
and current. However, since the factor e~ ** diminishes rapidly with time, 
this term, arising as the complementary function, soon ceases to be of any 
consequence, and is known as the ‘transient’. ‘The term 10 cos 5¢ gives the 
value of the permanent current resulting in the circuit from the application 
of the given alternating input. 
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Exercise 10.5 


1 Express in the form « + zy: 

(2) οεἶἶπ. (ii) εἶπ, (i) e 1*2*; (av) εἶ — e's (v) sinh (2 — 51); 
ies cos (3 + 1); (viz) In (-- 8); (viz) In 32; (ax) In (./3 + 2); 
(x) | (—,/3 — 1] 


2 Prove that e*~” = (coshx + sinh x) (cosy — isin y). 


3 Using the exponential definitions of sin and cos, prove the formula 
sin(? + @) = sin @cos@ + cos@sin@ and other trigonometrical for- 
mula. 


—2x 


4 Use cis 3x = e°* to integrate 6 7* cos 3x and e ** sin 3x. 


5 If sin (x + 1) =rcis@, prove that r = 3 (cosh 2y — cos 2x) and 
tan θ᾽ = cot x tanh y. 

6 Find the real and imaginary parts and the moduli of: 

(2) sin Z; (11) cos Z; (111) tanz, wherez=x+ yy; 

(10) Express the real part of tan z as ee 
cos 2x + cosh 2y 

7 Find zso that sin z = —3. 

8 Ifsinh(x + iy) = e*'", find χ and y. 

9 Express u = δ cosy, v = δ᾽ siny as a single equation in z(=x + 1y) 
and w(=u + 10). Repeat for u = cosh x cos y, v = sinh x sin y. 

10 Given that tanh z = w, express tanh 2x and tan 2y in terms of u and 2, 


where z = x + wandw =u+ w. 


11 Functions c(x) and s(x) are defined by the formulae 
Ι. : 1, 
C(x) = ge +e ™) and s(x) = F (e* —e ™), prove that 
1 
[c(x)]? + [s(x)]? = 1 andthat [e(x)]* — [s(x)]? = ¢(2x). 


12 Give the general solution of the differential equations: 


. dy dy ea dy 

-Ξ -- 8— 5 Deere Soa = 0; 
(1) 72 oe + 25y = (11) 12 2 τα + 2y : 
ον ἘΠ eds 
(222 ) a2 = 6 — 4x; (wv) τ + 9x = sin 41. 


13 Obtain the solutions of the following differential equations subject 
to the stated boundary conditions (dots indicate differentiation with 
respect to time): 

(1)  —6x% + 10x =O given thatx = Oandx = 3ati=0; 

(11) ἡ + 9y = 4, wherey = 0,7 = 2ati = 0. 
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*10.6 Transformations in the Argand diagram 


Suppose that z is a typical point of a configuration in the Argand diagram, 
and let z’ = z + a, where ais a fixed complex number. 


Translation: Ζ΄ = z+a 


A a 

Every point of the configuration undergoes a similar displacement specified 
by the (constant) vector OA. The transformation z’ = z + a therefore 
represents a translation. For example, z’ = z + 2 — 32 represents a trans- 
lation which carries the whole figure bodily 2 units in the positive x-direc- 
tion and 3 units in the negative y-direction. 

Consider next the transformation represented by z’ = pz, where p is 
real and positive. 


Its effect is to multiply the modulus of z by a factor p, whilst its argument 15 
unchanged. For example, if p = 3, the point 2 cis 50° would map into the 
point 6 cis 50°, while the point 2 — 51 would map into 6 — 151. Thus the 
transformation is an enlargement of the original configuration from the 
origin in the ratio p:1, the image of the configuration being szmilar to the 
original. The transformation is known as an enlargement (or magnification, 
sumilarity, dilatation, homothety) even when p < 1; and when p = 1, we 
obtain the identity transformation. 
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We have already seen in 10.2 that the transformation z’ = z cis @ has 
the effect of a pure rotation about the origin, since the modulus is un- 
changed, while the argument is increased by the angle @. 


The final image will be congruent with the original. Note that when 
φ = $n, we obtain z’ = iz, and we have already remarked that multiplica- 
tion by 7 may be interpreted geometrically as a positive quarter-turn. For 
rotation about a point other than the origin, say about the point A, remember- 
ing that vector AZ is represented by z — a, and AZ’ by z’ — a, we have 


Ζ' --α- (Ζ -- α) οἰ5 φ. 


Rotation: χ' -- a = (Ζ -- a) cis ᾧ 


For example, if z’ = iz + 4, we may write z’ — a = 1(Ζ -- a) 


4 
where a=21a+ 4, sothat a= - Ξε 2(1 +2). 
Thus z’ — 2 — 21] = i(z — 2 — 21), 


which represents a rotation through 90° about the point 2 + 21. 
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All the above transformations have been ‘direct’ in the sense that the 
clockwise or anti-clockwise sense of description of a figure is preserved in 
each case. A mirror reflection, however, is an ‘opposite’ transformation, 
the figure now being ‘turned over’. The simplest case is reflection in the 
real axis, given by the equation z’ = z*. Reflection in the imaginary axis 
would be represented by z’ = —2z* and reflections in other lines will be 
given by equations of the form z’ = az* + ὁ (a,b€C’). However, the 
latter equation does not always give a pure reflection; for in the simple case 
z’ = z* + a, where ἃ 15 real, there is a reflection in the real axis combined 
with a translation parallel to it. Such a transformation is known as a ‘glide 
reflection’. 


z* J. 


ee ol 


Glide reflection in real axis 
z= +a 


Again, z’ = pz*, where ρ is real (# 1), includes a similarity as well as a 
reflection. 


Zz’ = pz* 
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Composition of transformations 


Argand diagram methods provide a simple means of finding the result of 
combining successive transformations in the plane. For example, a rotation 
ἀπ about O followed by a translation +4 is represented by z’ = iz + 4, 
and we have already seen that this is equivalent to a single rotation about 
the point 2 + 21. 


It should be noted that the two transformations do not commute. Fir if the 
translation were followed by the rotation, we would get z’ = i(z + 4), 
which is a rotation about —2 + 22. 
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On the other hand, an enlargement ζ΄ = pz in the origin, together with 
a rotation z’ = zcis @ (also about the origin) do commute, giving the com- 
posite transformation Ζ' = pcis @ X z;1.e., Ζ΄ = az, where a is complex. 
This type of transformation we have already met, as a spiral similarity. 

Next consider the enlargement z’ = 3z in the origin, together with the 


half-turn z’ — 1 = —(z — 7) about 1. If the enlargement is done first, we 
get 
z—-i= —(3z-i) => z= —3z + 21. 


But if they are commuted, 


z= 3[1- --(Ζ -- ἢ] = 2 = -32 4+ Gi. 


In the first case, we may write ζ΄ = —3z + 27 in the form 
z’ — a= —3(z — a), where 4a = 21, 
giving a = 51; ie., χ' — $7 = —3(z — $7), which is a spiral similarity 


about the point 37. You should find the centre of the spiral similarity in the 
second case when the half-turn is done first. 

The above discussion opens up the whole study of plane transformations, 
and the interested reader may find a more thorough treatment of ele- 
mentary transformation geometry of the plane in an article by F. J. Budden 
in the Mathematical Gazette, Feb. 1969, pp. 132—158. Here we must content 
ourselves with leaving further work to the exercises, though we shall 
continue the development of the subject when dealing with functions of a 
complex variable in 10.8, where we shall be concerned with some simple 
cases of non-linear functions. 


Exercise 10.6 


1 Interpret the following transformations, and find the invariant points 
where possible: 

1) zo=z+2 — 41; Ζ᾽ Ξ: 2 - 4: -- Ζ: 
71) Z = 912: 

z’ = (5 cis 100°)z; 
z’ = z(1 —2); 


2 Write down an equation of the form z’ = f(z) to represent the follow- 
ing: | 

(1) A translation +3 in the direction of the imaginary axis. 

(1) A spiral similarity with ratio 3:1 and angle of rotation +135° 
about O. 

(111) A spiral similarity with ratio 3:1 and angle of rotation +135° 
about 7. 

(wv) A rotation through -- ἐπ about the point 1 — 2. 

(v) A half-turn about 4 + 1. 
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(v1) An enlargement 1:4 in the point 2 + 1. 

(vii) A magnification x3 in the origin followed by a translation —2 in 
the direction of the imaginary axis. 

(viii) The same as in (v7) but in the reverse order. 

(ix) An enlargement in the ratio 2:1 in the point 1 followed by a rotation 
through — ἀπ about the point 2. 

(x) The same as in (ix) but in the reverse order. 


4 Analyse the transformations: 
(1) Ζ’ Ξξ 22 - ΕἸ --α͵σ;ςο Cie Sz" Sela, 


4 By using complex numbers show that successive half-turns about 
a and ὦ are equivalent to a translation 248. 


5 Interpret the following transformations: 
(1) 2 -Ξῷ ζῆ: (11) z' = 12%; (a) z’ = 2* cis 2a; 
(w) z’ = 1z* — 2; (i), er ( 2 2). 


*10.7 Loci in the Argand diagram 


As a complex number is an ordered pair of reals, it may be used to repre- 
sent any two-dimensional vector, such as a displacement, or a force, or an 
impedance operator in the theory of alternating currents. Such vectors 
are subject to change: a particle moves in a plane so that the complex 
number representing its position traces out a curve, the resistance or the 
frequency of a circuit is altered with the result that its impedance is altered, 
and the corresponding points on the Argand diagram move. We shall 
therefore be concerned with the locus of a point z in the Argand diagram, 
moving subject to certain conditions. The equations of such curves may in 
certain cases be expressible algebraically in terms of z. For example, a 
circle centre O and radius p may be written 


ΙΖ] =p or zz* =p’, 


while the half-line through O making angle 6 with the real axis has equation 
arg z = 0. Similarly, the equation arg (z + 21) = 300° represents the 
half-line terminating at the point —2: and proceeding in the direction 
300°. 
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The equation of any circle, centre A and radius p, may be written 


Iz-al=p => (z—a)(z—a)* =p” 


=> zz* — za* — z*a + (aa* — p*) = 0. 


This can be rewritten in the usual Cartesian form, 

χ + γ᾽ + Qex + 2y +c =0 

by replacing z and z* by x + ty and x — wy. 

Circle equations may alternatively be written in the form 


A= a 


z—b 


=A (AER*), 


such an equation expressing the geometrical property that the point Z 
moves so that the ratio of its distances from two fixed points A and B is A, 


the locus under such conditions being called the ‘circle of Apollonius’. 
Likewise, the equation 


μ(Ξἢ) - 908 
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: : : Ζ 
represents a circular arc terminated by the points A, B; for arg 
Zz — 


is the angle of turn from BZ to AZ, and so the equation expresses the 
property that this angle is constant. In our figure we show the cases where 
φ = 50°, 130°, —130°, and —50°. 


¢ = —50° 


We may invent other examples of loci based upon our geometrical 
knowledge; e.g., the locus of Z so that AZ + BZ = constant is an ellipse, 
and this may be expressed in complex numbers as |z — αἱ + |z -- ὁ} =A 
(λε R*). Again, the inequality |z| < 1 represents the whole of the interior 
region of the unit circle; while 1 < Iz + i| < 3 represents the region 
lying between the circles centred at the point —7 with radii | and 3, includ- 
ing the circumference of the latter circle. | 


Exercise 10.7 


1 Describe the loci when z moves in the Argand diagram subject to the 
following conditions: 


(1) ΙΖ τε“. (11) ΙΖ - Ι; = ΙΖ + 1]; 

(ii) |z -- a] = [Σ + 1]; (wv) 2 = 2%; 

(Ὁ) arg(z — i) = —3m5 (vi) |e] t+ ie-1 +a] = 2; 
Ὲ ΣΙ ἘΞΟῚ nt 

(vit ) args ἐπ; (viii) |z| = 3|2 -- 1); 


(ix) arg(z — 1) —arg(z +7) =m; (x) ΙΖ -- Ι} + Iz + l| = 4͵ 


2 Shade the regions of the Argand diagram for which z is subject to the 
following restraints: 


(a) 1253; (i) |z + 2| 2; (i) |z + 41} > 3; 
(wv) Iz -—3+%| > 2; (v) ΙΖ -- {|| [1 1} <4; (v2 ) 5." ay: 


4 From the equation ΙΖ + 16| = 4|z + lI, deduce that z| = 4, and 
interpret geometrically. 
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4 Show that |z + 2:| = [212 — 1| represents a circle, and find its centre 
and radius. 
5 Express |z/(z — 1)} = 4 using z and z* instead of modulus signs. Repeat 


for |(z — a)/(z — b)| = A. 


6 If z moves along the semicircle connecting the points 7, 1 and —z, 
describe the loci of the following: 

(iy 9 Ζ: (1) —2z; (211) 12; (Ὁ) 242-- 12; 

yet: TZ (vit) z7 — 1; (vii) χ΄ 2. 

7 If z moves along the real axis from —1 to +1, find the locus of 
(1 — iz)/(z -- 1). 

8 If (z -- 1ὴ)ὴ{(2 -- 1) is purely imaginary, show that the locus of z 1s a 
circle centre $(1 + 7), radius 2,/2. 


9 Ifz describes the semicircle from the point 1 through the point 1 to the 
point —1, and thence the part of the real axis from —1 to +1, describe 
the locus of: 

GO) 24 i Goze (ΝΠ ἶν yee yz. 

10 If z describes the square 0, 1, 1 + 7, 7, 0, describe the locus of: 

Cae ae (11) ¢7; (111) cosh z. 


*10.8 Functions of a complex variable 
When x is a real variable, the function f(x) is usually illustrated by a 
graph. Alternatively, however, it can be represented as a mapping from a 


set of values of x (its domain) onto a set of values of f(x) (its range) by an 
arrow diagram. 


SF (%1) 


x4 


X2 


J (2) 


xt» (x) 
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For instance, 
fixt>2x+ 1 


can be represented by 


or by 


If z is a complex variable, the function f(z) can no longer be shown in 
the first of these ways, as a graph, and has to be shown as a mapping, this 
time from a two-dimensional set of points (its domain) in one Argand 
diagram onto a set of points in another Argand diagram (its range). 


z-plane w-plane 


f (2) 


So, just as we can speak of y = f(x) as a mapping /: x +> y of points from 
an x-line into a y-line, so we can also speak of ὦ = f(z) as a mapping 
jf: z+» ὦ of points from a z-plane into a w-plane. 

For example, the function w = 2z + 3 — 1 maps points as follows: 


O vU 1 J VewW A B K 
z| 0 ] ὶ τὴ wi τ ὦ ἐς, ει eee 
ὦ -ῦ δῶ 344 8: toa Bae Fee εὐ τὶ 0 


Ο’ {΄ I’ J’ Vv’ WwW’ A’ Β΄ Κξ Ο 
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But it is rather simpler in this case to think of the z-plane and w-plane as 
being superimposed, and to regard w = 2z + 3 —17 as carrying the 
point z to a new position, first by the enlargement in the origin z’ = 22, 
followed by the translation w = z’ + 3 — 1. Ifz belongs to a configuration, 
then the whole of the configuration is carried by the transformation to the 
new position. 


Such mappings are extremely important when the theory of complex 
numbers is applied to physical problems, ranging from the investigation 
of streamlines of fluids to that of lines of electric force and equipotentials, 
and to the study of the behaviour of oscillating circuits. Within mathe- 
matics itself, functions of a complex variable are equally important: 
exploration of what meaning can be given to such ideas as continuity, 
differentiation and integration has led to one of its most fertile areas. But 
this is a journey which cannot be undertaken here, and we confine ourselves 
to investigating a few examples. 
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The general linear function 
It is clear that the general linear transformation 


w=az+b (abeEC) 
or w= (pcisod)z -- ὁ (wherea = pcis d) 


may be analysed into basic elementary steps by splitting the formula as 
follows: 


Z, = pz an enlargement in O τις, τοῖο τὸ 
᾿ ἶ spiral similarity 
Zo = Z, cis = az a rotation about O 


w=2Z,+6=az+5 atranslation. 


Thus the general linear transformation may be composed of enlargements, 
rotations, and translations, and so must preserve straight lines. When 
p = 1, it will preserve distance (and so be an zsometric transformation) ; 
when p # 1 the image will be similar to the original figure. The example 
w = 2z + 3 — 7 was such a transformation, though the element of rota- 
tion was absent because the value of a (= 2) was real (so that @ = 0). 


The reciprocal function, w = 1/z 


The function 1/z is immediately seen to be quite different in that it does 
not preserve straight lines. We may see this easily by considering the three 
collinear points 


(4 --Ο), S(l+2), ΤΑ — ἡ), 
which have as images: 
U'(1+0:), S$ - - ἡ, T$(1 +2); 


and these do not lie on a straight line. 


The transformation represented by ὦ = 1/z may be investigated geo- 
metrically, but this requires a knowledge of inversion and we content our- 
selves with a simple case, using analytical methods. 
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Example 1 


] 
If w= —,wherew =u+wandz=x+t+ y, 
Zz 


—1 
then μ Ἡ ἴν τῷ - -- cee 
MARI ORF ΞΕ ἢ 
ζ ἡ 
O that C= d = ——_——» 
5 e+ γ an ᾿ $y? 
Consider a line u = constant = A in the w-plane. 
Since uw = ἘΞ. = A, we have 
χΧ ἜΚ 


] 
x? + γ' — -χ =0, and this isa circle in the z-plane. 


A 


Similarly the line y = μ in the z-plane will lead to the circle locus whose 
equation Is 


] 
u? + v? --- νυ =0 inthe w-plane. 
μ 


Example 2 
If z traces out the circle ΙΖ] = | anti-clockwise from U back to U, find 
— |] 
the locus of w = ᾿ . 
Ζ-1 


2 
This could be done by writing ὦ = 1 = and breaking this 
Ζ 


down into 
Z,=2z+1, a translation ; 
Ι Ι : : 
Ζο ΞΞ -- Ξ ; the reciprocal transformation; 
Zz, zti 
= 
Ζᾳ ΞΞ 225 ΞΞ , a half-turn and enlargement; 
2.8: 
2 Ζ2-- ἰ ; 
w=2Z+1=1- = a translation ; 


z+1 z+1 
and then finding the geometrical effect of each of these on the unit circle. 
Or we may take z = cis 9, so that 


cisO0—1 οἱβὲθ — cis(—30) — 2isin 70 | 


SS ee εξ San 0. 

cisO+1  cisd0 + cis(—40)  2cos46 : 
Thus, as 0 goes from 0 to 7, w traces out the imaginary axis from O to 
infinity ; whilst as θ goes from 7 to 27, w traces the other half of the imaginary 


axis from — oo up to the origin. 
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There is, however, a simple geometric solution. For if U, V are the 
points 1, —1 (our usual notation), then z — 1 and z + 1 are represented 
by the vectors UZ, ΚΖ. The fact that these are perpendicular (Δ. UZV being 
the angle in a semi-circle), means that their quotient w must be ‘purely 
imaginary’, or a real multiple of 7. Hence w les on the imaginary axis. 


+w 


Alternatively we may rearrange the formula as z = . Since 


ΙΖ] = 1, it follows that 1 + w| = 1 - wh, so that w is equidistant from the 
points —1 and +1, and therefore lies on the imaginary axis. 


Example 3 


Ifw = z?, find the locus of z when w traces the lines u = constant, v = con- 
stant. 


u+ w= (x + ty)? = x? — y? + Qixy; 

8 2S eH γῇ, w= Qxy. 

Hence wu = constant gives a family of rectangular hyperbolas, 
x -- γ' =A(ER), 


v = constant gives another family of rectangular hyperbolas, 


xy = U(ER). 


All mutual intersections are perpendicular and the families are said to be 
orthogonal. 


Example 4 
Express u=e *siny, v=e ~cosy asarelation between w and z. 


e *(siny Ὁ 1 008 ψ) =e@ ~iz(cosy -- 1 51} χ) 


w=utw 


=ie*e *“=i2¢e ”, 
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Polynomial functions : the fundamental theorem of algebra 


Earlier in this chapter the reader may have had the uneasy feeling that 
there may just possibly exist a certain polynomial function of the form 


JZ) = Ayz" - fy i τ ae υΣ Ω,.--κΖ + ay 


with the property that there is no value of z whatsoever which makes 
J (z) = 0. Certainly there are quadratic equations which have no solutions 
in the field of real numbers, and it might reasonably have been supposed 
by analogy that there are also polynomial equations which have no solu- 
tions even in the much wider field of complex numbers. The reader should 
not be ashamed of harbouring such doubts, for the same question troubled 
many mathematicians for ages. It was finally resolved, and doubts dis- 
pelled, by the so-called fundamental theorem of algebra, which states that for 
every polynomial f(z) there does indeed exist a value of z (Ε C’), which 
satisfies the equation f(z) = 0. In other words, every polynomial equation 
has at least one root in the field of complex numbers, and this is still true 
even when the coefficients, a), a,,..., a, are themselves complex. So the 
theorem, which is an example of what mathematicians call an exzstence 
theorem, is really a very powerful result; and, once proved, it is a simple 
matter to prove further that a polynomial of degree n has n roots in complex 
algebra (repeated roots being counted according to their multiplicity). But 
the proof of the fundamental theorem of algebra is difficult, involving 
sophisticated analytical concepts. For readers who wish to pursue the 
matter, a traditional proof may be found in Hardy, G. H., A course in pure 
mathematics (Appendix IT) or in Dickson, L. E., Furst course in the theory of 
equations (Appendix). But more modern methods of algebraic topology 
enable the proof to be considerably shortened. 


Exercise 10.8 


1 Ifw = (z — 1)/(z + 1) (ason p. 159), show that when w traces out the 
lines wu = constant and v = constant, the locus of z is two systems of 
circles. Find also the locus of w as z traces the upper half of the circle 
diameter —1, +1. 


2 Ifw = (z -- 1)7(2 + 1) and lies below the real axis, show that w lies 
outside the unit circle jw] = 1. How will w move as z travels along the real 
axis from — ΟΟ to +00? 


3 Ifw = (z — 61)/(z + 8), show that the locus of z is a circle when ὦ 
traces the imaginary axis, but a line when w traces the real axis. Find the 
equations of the z-loci. 


4 Examine the effect of the transformation w = (z + 2)/(z + 7) on the 
system of concentric circles having centres at the origin in the z-plane. 
What is the image of the exterior of the circle [2] = 1? 
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5 Ifw = (z — a)/(z — δ) verify that w is obtained by making triangle 
0, 1, ὦ similar to triangle z, ὦ, a. 


6 Ifw = (z+ 1)? and z traces the unit circle, show that the locus of w 
has polar equation r = 2(1 + cos @) ( ἃ cardioid). 


7 Show that the lines x = constant and y = constant in the z-plane are 
transformed by w = cosh z into sets of confocal ellipses and hyperbolas in 
the w-plane. 


8 What is the effect of ὦ = e? on the system of lines x = constant, 
y = constant in the z-plane? 


9 If w = 4/(z + 1)? and |z| = 1, show that w describes the parabola 
whose polar equation is r = sec* 30. Show that the interior of the circle 
maps into the exterior of the parabola. 

Ze ἢ 


10 Find the invariant points of the transformation w = ------------ 
2+ 1(2Ζ -- 1} 


τι If(l -- z)(1 -- w) = 1 and arg z = {π, find the locus of w. 


12 Whatis the effect of w = cos zon the lines x = constant, y = constant 
in the z-plane? 


Miscellaneous problems _ 10 


1 Solve the equation Ζῇ + 10z* + 169 = 0 by first solving for z*, and 
then for z in the form a + ib where a and ὦ are real. Hence or otherwise, 
express z* + 10z? + 169 as the product of two real quadratic factors. 


Ϊ ] l ] 
2 Show that for real x, —, == = ult 
l+x 2,λχπὶῖλ x+1 


Deduce that 


Ge AY... yt@-nii(t- oo) 
dx®-1\1 + x? ε (n M5; (x — 1)" Cee 


Prove that, ifx — 7 = r (cos θ — 1 sin 8), 


then (x — 7) "=r" (cos nO + isin v8), 


and obtain a similar expression for (x + 1) ΄΄. 


Hence obtain a formula for 


ΠΊΩΝ 1 l ; 
ria Toe in terms of r and 0, 


and deduce that for 0 < x < ἐπ, 


α" (n -- 1)! 


τα (tan ξὺν Ὁ 1 aye sin (n cot” ὁ x), (M.E.1.) 
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3 What geometrical transformation is given by 
z’ —¢ = (z —c) (cos @ + isin 8), where Ὁ is a complex constant? 


If the transformation z+> z’ corresponds to a turn of 120° about the 
point (1,0), and the transformation z’+> z” corresponds to a turn of 60° 


about the point (—3, 0), show that ζ΄ = —z + 2a 37, and hence express 
in geometrical terms the single transformation equivalent to the two 
transformations in the order given. (S.M.P.) 


4 (i) Given that one root of the equation ζῇ — 42° + 12z* + 4z -- 13 =0 
is 2 — 31, find the other three roots. 
(11) Given z = 2(cos 2/6 + 7sin 7/6) illustrate on the Argand diagram the 
points representing the complex numbers z, z*, (z*)*, (z*)?/|z|. 

S is the set of points in the first quadrant which lie on the circumference 
of the circle |z| = 2. Find the image of S under the mapping z +> (z*)?/|z]. 


(M.E.I.) 


5 The vertices of a triangle in the Argand diagram are given by the com- 
plex numbers a, 6, c. Say whether the following are true or false, with 
reasons: 

(7) The centroid is at the point $(a + ὁ + ὁ). 

(1) The area of the triangle is $[(b — c)* + (ὁ — a)? + (a — δ)51. 
(11) The triangle is equulatera if and only if 

(α -- δ) (α -- ο) + (ὦ -- ο)" =0. (ο.5.) 
6 Let f(z) = (Ζ + z') where z is a non-zero complex number. 

(1) If z describes a circle Iz| = r where 7 is a positive real constant, 
show that ifr τέ 1, f(z) describes an ellipse in the complex number plane. 
What happens ifr = 1? 

(11) If z describes the half-line arg z = 6, where θ is constant, prove 
that, in general (i.e., if certain exceptional values of 0 are excluded) 
J (z) describes one branch of a hyperbola. 

(111) What are the exceptional cases in (7)? Describe what happens for 
these exceptional cases. (O.S.) 


7 If ΙΖ —1|=1, prove that arg(z -- 1) = 2argz= *arg(z* — z), 
and give a geometrical interpretation of these results. (O.S.) 


8 Show that the roots of (z + 7)? = (z — 7)? are cot kn/5 (k = 1, 2, 3, 4), 
and hence evaluate cot” 2/5 + cot* 22/5, and cot 2/5 cot 27/5. Generalise 
the result = thence obtain 


cor = 1: = ἐμ(2η -- 1). 
1 


9 Prove that the roots οἵ 


2k +1 
(z+1)"+ (z-1)\"=0 are ico Ὁ δ (k= 0,1,2.,...,η -- 1). 
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Deduce that 


= k ] tes Qk ] 
Σ πῇ and Σ τς ΠΟΤΕ δος 
0 2n 0 2n 


10 Show that 


(1 + x)?" + (1 — x) = 3}. + tan? ay 


and prove that 


: (2k — 1)n 

= Dy, 
Σ, sec? re n 
11 Prove that e?*" = 1 for all integral values of k, and that e#” = 7. 
Deduce that a value of i’ is ὁ. 2, and using the result ¢?*"* = 1, find a for- 
mula for all possible values of 7’. (c.s.) 


12 Ifcos z = w, prove that 


ει υ" ει“ υ" 


- Ξ and ----- τ -----  Ξ ] 
cos*x  sin* x cosh? y ε sinh? y 


and discuss interpretations in terms of the Argand diagram. 


12 Fora rotation through ἃ and about O, we have 
z=zcisa or x +1’ = (x + y)(cosa -Ἐ 1 51η α). 
Equating real and imaginary parts gives: 


x’ = xcosa — ysing 


cosa —sing 
sin ἃ COs a 


; : , so we get the rotation matrix 

y =xsina + ycosa 

Show that reflection in the line y = x tan ἃ can be represented 
z’ = z* cis 2a. 


By equating real and imaginary parts, find the matrix for this reflection. 


II 


Further vectors and mechanics 


11.1 Points of subdivision 


Suppose that two masses are attached to the ends of a light rod AB, 2 kg 
at A and 3 kg at B. 


A P B 
tt -----ς-ς- «3 
a P δ 


It is found from elementary experiments that such a rod will balance at 
the point P, 


where 2AP = 3PB, 


so we call this the ‘ point of balance’. 
If we now denote the position vectors (from an origin O) of A, B, P by 


a, b, p, 

then 2(p — a) = 3(b — p) 

=> Sp = 2a + 36 
2a + 3b 

=> — ° 
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More generally, let us consider two masses, « at A and f at B. 


A P B 
$$ eB 
a » ὃ 


Then their ‘ point of balance’ would be at P, where 


aAP = PPB. 
So ap — a) = f(b — p) 
=> («+ βὺ)» =aa+ Bb 
_ da + Bb 
re 


> 


It is clear that only the ratio «: f is important. For if ἃ and f were both 
multiplied by the same number k, 


KAAP = κβΡΒ => «AP = PPB 
koa + KPb aa fb 
and P ΞΞ SS .- ὁ  ., 
κα + KB a+ Bp 


so that the position of P is unchanged. 

When, however, the ratio changes, P will move along the line AB; and 
if either of the numbers a, 6 were allowed to be negative, P would fall 
outside the segment AB and be either beyond A or beyond B. 


Example 1 
(1) a= --2ῶΖ,β =+3. 
So —2AP = 3PB and P is beyond B. 


--2 +3 
a 
A B Ρ 
Ν —2a + 3b 


Furthermore, p = = —2a + 38. 


ΞΟ} 
Gi) «= +2,p = —1. 
So 24P = —PB and Pis beyond A. 


2 —] 
$e - 
P A B 


2a — ὃ 


ΞΡ 
πετῶ ὦ 


Furthermore, p = 


The only exceptional case is when ἃ = —f. 
Then «AP = —oPB => AP = —PB, 
which is impossible when A, B are distinct 

αα --αἀὖ aa — αὖ 


(and p= ie ae which is meaningless). 
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Exercise Il.1a 


1 IfA, Band P have position vectors a, ὃ and p, find p in terms ofa and ὃ 
and illustrate by a sketch when: 

(1) AP = PB; (1) 3AP = PB; (11) AP = 3PB; 

(wv) 3AP = —PB; (v) AP = —3PB. 


2 IfA, B have position vectors a, b, illustrate the points whose position 
vectors are: 


(ὦ) fa+ 4b; (ii) $a τ ἐδ; (iii) 3a + 36; 


(1) 3a + 3b; (v) 2a —b; (vt) 3a — 26; 
(vit) 4a — 36; (vi) 26 — a; (1x) 36 — 2a; 
(x) 46 — 3a. 


3 The points A and B have position vectors 62 — 57 and 41 — 37 res- 
pectively. Show that the mid-point M of AB is collinear with the two 
points X, Y whose position vectors are 24 — 6j and 1112 respectively. (0.c.) 


Points of subdivision 


If AP = βΡΒ, 


ax β 
then ΔΑΡ = BPB eee (Tae 
an —_=-; a 

ΡΒ a’ 
so P is said to divide AB in the ratio β:α. 
Example 2 
(1) a= 2,8 = 3. -2 3 
Se 
So 2AP = 3PB A. ea ae UB 
coal — 
oa 
ΡΒ * 
and P divides AB in the ratio 3:2. 
(1) a= —2, B = 3. 
So 2AP = —3PB 
AP, 
PB : 
and P divides AB in the ratio 3: — 2. 
—2 3 
—_— cue 
δ ἘΝ σπ---2 EH He 


— τ 
— 
—— 
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(1) a=2,p = —1. 

So —2AP = PB So  —— 
AP ; Saar _ << soe 

> ppt? ae 

and P divides AB in the ratio — 1:2. 


Summarising, we see that the following statements are all equivalent: 


P is the centre of mass of a at A and β at B 
aAP = ΒΡΒ 


aa + fb 
a+ Bp 
P divides AB in the ratio β:α 


Example 3 


Find the position vectors of the points P, Q which divide AB in the ratio 
1:2 and —5:2 respectively. 


(1) P divides AB in the ratio 1:2 

= P is the ‘point of balance’ of 2 at A and | at B 
2a + 1b 

at 

(11) Q divides AB in the ratio 5: —2 

=> Q is the ‘point of balance’ of —2 at A and 5 at-B 


— 4 1 
> = ga + 3b. 


Exercise 11.1b 


1 The vertices of AABC have position vectors a, b, ec. Find, in terms of 
a, ὃ, c, the position vectors of 
(1) D.E. F. the mid-points of BC, CA, AB respectively ; 
(11) the point X which divides the line AD in the ratio 2:1, and the cor- 
responding points Y, Z dividing BE, CF in the same ratio. 

Hence show that the medians AD, BE, CF are concurrent (at a point 
called the centroid of AABC). 


2 The vertices A, B, C of AABC have position vectors a, 6, ec. Find, in 
terms of a, ὃ and e, the position vectors of 
(1) P, which divides BC in the ratio 3:2; 
(72) Q, which divides CA in the ratio 1:3; 
(1) R, which divides AB in the ratio 2:1; 
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(wv) XX, which divides AP in the ratio 5:1; 
(v) Y, which divides BQ in the ratio 2:1; 
(vi) Z, which divides CR in the ratio 1:1. 
Hence show that AP, BQ, GR are concurrent. 


3 Inno. 2 find also the position vectors of 

(2) U, which divides BC in the ratio —3: 2; 

(1) V, which divides CA in the ratio —1:3; 

(111) W, which divides AB in the ratio —2:1. 
Hence show that: 

U also divides QR in the ratio — 3:4; 

V also divides RP in the ratio —5:3; 

and W also divides PQ in the ratio — 4:5. 


4 Ifthe points A, B, C have position vectors a, 6, ¢ from an origin O, 
show that the equation 


r=ta-+t (1 — ἢ δ, 
where ¢ is a parameter, represents the straight line AB, and find the equa- 
tion of BC. 

Find the equation of the straight line joining L the mid-point of OA to M 


the mid-point of BC. Find also the position vector of the point in which the 
line LM meets the straight line joining the mid-point of OB to the mid- 


\\ point of AC. {1 


5 OABC isa square of side 2a. 1,7 are unit vectors along OA, OC. The 
mid-point of AB is L; the mid-point of BC is M; OL, AM meet at P; BP 
meets OA at N. Show that the segment OP can be measured by the vector 
A(2ai + aj) and also by the vector 2at + p(2aj — at). Hence determine ἃ 
and μ. Prove that ON = 304. (ο.α.) 


11.2 Centres of mass 


In the last section, when masses a and f were placed at two points whose 
position vectors are a and Bb, we referred to 


_ aa + Bb 
~ a+tBp 


as their “point of balance’. 
Let us now suppose that we have a number of masses: 


M,,Mo,M3,M,... 


at 1,,%,13,74,... respectively. 
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O 


Let G be the point which has position vector 7, where 


_— MX, τῇ Moy + Marz + Mgr, +°°' 
SSS aaa — — eee SSeS 
71 + mM, Ἴ ms +m, aa 


Now G is a definite point, but there is a serious risk that its position will 
depend on our choice of origin, O. 

To discover whether or not this is so, we must take another origin O’ and 
see whether such a calculation based on O’ leads to a different point G’. 
Let us suppose that from the new origin O’, O has position vector ὃ, and 
m,,M,,... have position vectors r,,12,---- 


Then γ᾽ =e+ σι, etc. 


So the position vector of G’, calculated from O’, is 
my, + Mo + Ms to... 
m, + mM, + m3 +-°° 
m, + Mm, + mg +°:: 


=f 


MY, + Mofo + M3r3 1° °° 
my, +m. +m, +.--- 


— ot 


So F =c4+ fF. 

Hence O’G' = O'O + OG= O0'G 
=> G and G’ are identical. 

So if masses m,,M,™M3... 


are at points 7,,1%),713..., 
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: — Myr, + Mofo + Marz +°°° 
the point ro ee ot, I τε τσ, ὦ ae 


mM, +m, + Mz pees 


is not dependent on our choice of origin, but only on the given system of 
masses, and is therefore called their centre of mass. In the particular case 
when the masses are all equal this becomes 


] 
r=-(r, ἘΣ te a Pa) 
n 


which is also known as the centroid of the given points. So the centroid of 
r, andr, isat3(r, + r,), δῃὰ οὔ γι, Κρ, 7. isatg(r; + rg + 13). 
Finally, in terms of components, the centre of mass 15 


χὶ τὶ ] τἰ zk = 
᾿ τα ἤ +mo+::: 
_ MX, T MX» EE (0, ay μη te 
m, + Mo mM, +My +>: 


gE ee +++: 
m, +m, +-°: 


k. 


So the centre of mass has coordinates 


-- — MX, + MX» +:-- 
mM, + Mo +:--> 


23 NY eae or (2 my ἘΠῚ 


m, -Ἔ πιο +-::: ym ym Σ᾽ m 


_ MZ, + MZ, eer 
my Ἢ Mig. τ τ’ 


Example 1 


A light rectangular frame of length 4 m, breadth 3 m and height 2 m has 
masses | kg, 2 kg, 3 kg attached at corners as shown in the figure. Where 
is their centre of mass? 
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Taking unit vectors 7, 7, Καὶ as shown, the three masses are: 
I kg atthe point 42 + 37 

2 Κρ atthe point 4¢ + 2k 

3kg atthe point 37. 


So their centre of mass is at 


1 x (404 37) +2 x (464+ 2k) +3 χ 3) 
1+243 
126+ 147 + 4k 
- a 


r= 


= 21+ 27 + gk. 


Alternatively, taking axes Ox, Oy, Oz in the same directions as 1, 7, k, 
the calculation could be set out: 


m x y Σ mx my mz 

4 3 0 4 3 0 

2 4 2 8 4 

3 0 3 0 0 9 0 
Yim = 6 Sm ΞΞ 12 Smy=12 Ymz=4 
So the centre of mass has coordinates 
z= ΣΧ 0. η--Σ Ὁ. 9. on ΞΕ 


and is therefore at the point (2, 2, 3). 


Example 2 


If a square of side a is removed from the corner of a uniform square lamina 
of side 2a, where is the centre of mass of the remaining piece? 
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We can regard the remaining piece as being composed of a large square of 
mass 4 together with a smaller square of negative mass —1. So, using 
axes Ox, Oy as shown, we have: 


m Χ y mx my 
Large square 4 a a 4a 4a 
Small square —] 5a 5a —ta —ta 
Final lamina m= Simx=4a > my = fa 
Σ mx Ta _ Ddmy 7a 
So = = τ τς y= Sa FT 
yim 6 m 


and the centre of mass lies at (7a/6, 7a/6). 


Exercise 11.24 


1 Ifthe vertices of a tetrahedron ABCD have position vectors a, b, c, d, 

find the position vectors of 

(1) the centroid P of ABCD; 

(ii) the point dividing AP in the ratio 3:1. What do you deduce about 

this and the three other similar lines? 

(iit) the mid-point of the line joining the mid-points of AB and CD. 
Hence find seven lines associated with the tetrahedron which are all 

concurrent. At what point do they meet? 


2 IfG,G’ are the centroids of AABC, AA'B'C, prove that 
AA' + BB' + CC’ = 3GG’. 


4 Find the centres of mass of 
(1) lkgata,2kgat 6,3 kg ate; 

(ii) 1 gat (0,0,0),4gat (1,4, 2),3gat (3, —1, —2),2g at (—4, 1,6); 
(i) lkg ati + 27, 4 Κρ αἱ} — 2Κ, 5 Κα αὶ --Ὃ τ 37 +k. 

4 Ifa cube of side 1 m is removed from the corner of a uniform solid 
cube of side 2 m, how far has its centre of mass been displaced ? 


5 A mass of 2 kg is placed at the point (4, 1). Find the masses which 
should be placed at (—1, 3) and (—2, —1) in order that the centroid of 
the three masses should be at the origin. (S.M.P.) 


Continuous distributions of mass 


So far, we have limited our investigations to systems consisting of a finite 
number of discrete masses. When there is a continuous distribution of mass, 
the method is very similar, except that we shall usually begin by dividing 
the given body into a series of elements. 
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Example 2 


Uniform triangular lamina 


We first divide the lamina into a set of elemental strips parallel to one of its 
sides. Now the centre of mass of each strip is at its mid-point, and these are 
all collinear on a median. Hence the centre of mass of the lamina lies on 
this median, and similarly on the other two medians. It is therefore situated 
at the centroid of the given triangle. 


Example 3 


Find the centre of mass of: 
(1) A uniform lamina (or plate) in the shape of the area between y = x’, 
y = Oandx = 1. 

(11) A uniform solid formed by rotating this area about the x-axis. 


(1) Uniform lamina 


O 


Ox 


Suppose that the lamina has surface density (i.e., mass per unit area) @, 
and that it is split into narrow strips of width dx and height y. 

Then the strip shown has mass m = oy 6x which can be regarded as 
concentrated at the point (x, $y). 
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Dmx > (ay dx)x Σ᾿ xy dx 


So x= =: = 
Lm > (oy dx) > y ox 

ee ἄνα. a ae τς ἢ 

a νὰ ἢ αν 1.- “ 

and το Vt Σ (συ ὄχ _ δ ν᾿ ox 
΄ Xm DX (ay ox) > y ox 

=> __2loy dx Σήοχ ἀν io _ 5 
A" Πα [χὰ oF 


So the centre of mass is at (3, 35). 


(11) Uniform solid 


Suppose that the solid has volume density p and that it is sliced into thin 
discs of thickness 6x and radius y. 


Then the mass of the disc shown = m = p(ny* 6x) = pry” Ox. 
Now if the centre of mass is at (x, 7), we see by symmetry that y = 0. 


ΠΣ πὰ > (pny? dx)x Dd xy? bx 
Also ¥ = S— = = = SS 
Σ πὶ > pry” ox Dy Ox 
τς Jory? dx [χ᾽ dx _ 
(oyrdx  fixtdxy — 


| 
Hl 


| Of 


So the centre of mass is at (2, 0). 
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Example 4 
Find the centre of mass of: 


(1) a uniform semi-circular wire of radius a; 
(11) a uniform semi-circular plate of radius a. 


(1) Semz-circular wire 


Take an axis Ox along the line of symmetry and consider a small section 
of wire which subtends angle 66 at the centre. The length of this section is 
a 00, so if its line density is p it will have mass m = pa ὅθ. 

Also the length of the wire is πα, so its total mass will be pza. 


By symmetry, 7 = 0 


θ 
and ἃ = Σ mx = za ον ὃ cen’ = ἘΝῚ cos 6 ὃθ 
Σ m ρπα π 
Ἔπ|2 2 
=> z=£| cos θ 4θ = “ [sin 01°23 = =. 
ee ee π π 


So the centre of mass of the wire is at (2αύπ, 0), 1.6., approximately at 
(0.64, 0). 


(11) Semi-circular plate 


Method 1 


Suppose that the plate is divided into strips of length 2y and width ox 
and that its surface density is σ. 
Then the mass of strip = o(2y 6x) and the total mass of the plate 
1} 
= ofa σ. 
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ΟΣ πὰ Σ (Qoy 6x)x 4 
ῷ = OO SO, a -- ὃ 
δ > m 51a°o πα" ΣῪ 08 
> Y= 5 eee aa ee 
πα" |, πα Jo 


— / 


4 4a 
=—=[0 -- (147) ) =: 
Also, by symmetry, y = 0. 


So the centre of mass is at (4a/3z, 0); or, approximately, (0.426, 0). 


Method 2 


This last result can be obtained more easily if we make use of the previous 
result for a semi-circular wire. For the semi-circular plate can be dissected 
into a large number of sectors and each of these sectors is approximately 
a triangle with its centre of mass at a distance 2a/3 from O. 


a 
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In this way we see that the uniform plate of radius a has the same centre 
of mass as a uniform wire of radius 2a/3. 


2(2a/3) 7 4a 
π ΝΕ 


Hence x = 


Exercise 11.2b 


1 Find the coordinates of the centres of mass of uniform laminae bounded 


by: 

(2) ψ- 9 y = 0, x= 3 

(ἢ) yor, y=0, χεὶ, χΞ 2; 
(11) γ' = 4x, χ =9; 

(iv) ψ τ Ts y =0, x=h 


2 Find the coordinates of the centres of mass of the uniform solids formed 
by rotating the areas of no. 1 through 360° about the x-axis. 


4 By rotating a quadrant of the curve y = ,/(a? — x”) about the x-axis, 
find the position of the centre of mass of a uniform solid hemisphere. 


4 Byslicing a uniform tetrahedron parallel to one of its faces, show that 
its centre of mass lies on the line joining the centroid of this face to the 
opposite vertex. Hence show that the centre of mass 

(1) ofa uniform tetrahedron lies at the centroid of its four vertices; 

(1) of a uniform cone (or pyramid) is at one-quarter of its height above 
the base (cf. 2 (2) ); 

(11) ofa uniform hemisphere shell is half-way from its centre to the middle- 
point of the shell (use the result of no. 3 and part (11)). 


5 Asphere of radius a is cut into two portions by a plane which is distant 
za from the centre of the sphere. Show by integration that the volume of 
the smaller of the two portions is 57a?/24. 

Calculate the distance of the centre of gravity of this smaller portion 
from the centre of the sphere. (ο.α.) 


6 Use Simpson’s rule to find the approximate value of the x-coordinate 
of the centroid of the area bounded by the x-axis, the line x = 4, and a 
curve joining the points 

x 0 1 2 3 4 

y 0 1.05 2.21 3.50 4.92. (0.¢.) 


11.3 Scalar products 


So far we have confined ourselves to the addition and subtraction of vectors 
and to their multiplication by a scalar. We now move to two other opera- 
tions which can be performed on a pair of vectors, and obtain their scalar 
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product and vector product. These are both exceedingly important throughout 
applied mathematics and particularly for describing (respectively) the 
work done by a force and the moment of a force. In this section we shall 
concentrate upon the scalar product. 

Ifa and b are two vectors with magnitudes a and ὁ whose directions are 
separated by an angle 0, we define their scalar product as 


a.b = ab cos θ. 


Ζὰ 


As οοβί-- Θ) = cos θ we need not be concerned about the direction in 
which to measure 9, and it is also apparent that 


a.b = a(bcos 8) = a Χ projection of b ona 


b(a cos 8) = ὁ X projection of a on ὅ. 


As an illustration, if F is a force and ¢ is a displacement of its point of 
application, the scalar product F.c is called the work done by the force in 
this displacement. | 
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So the work can be regarded as 


(projection of force F in direction of displacement c) 
x (magnitude of displacement), 
or (projection of displacement e in direction of force F) 
x (magnitude of force) ; 


and if F and ὁ are perpendicular, then F.e = 0, so that no work is done 
when a force is displaced perpendicular to its line of action. 

A number of results follow immediately from the definition of the scalar 
product: 


(1) a.a = aacos0 = a 
a.a is usually abbreviated as α΄. So a? 


(1) a.b = b.a 


I 
Θ 


1.e., scalar product is commutative). 


111) Ifa and 6 are non-zero vectors, 


( 
( 


ab=0 = abcoss0@=0 = θ-- σ᾿ 


& 


So a.b=0O = aand Bare perpendicular. 


(ἢν) a.(Ab) = a(Ab) cos 0 = Ahab cos θ 
A(a.b) = A(ab cos 0) = hab cos θ. 


b Ab 
So a.(Ab) = A(a.b) 


i.e., the scalar product is distributive with respect to multiplication by a scalar. 

(This last result, like some of its predecessors and also the next, at first 
appears trivial. But it is worthwhile pausing to appreciate its importance, 
and to be grateful that when we are confronted with a. (26), (2a).6 and 
2(a.b), we know without hesitation that they are all equal.) 
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(v) (a+ b).c =a.c + b.e, 


i.e., scalar product 15 distributive with respect to addition. 


This result, too, 15 clear from a figure. 


For (a+ 6b).e = (projection ofa+b6 one) Χ ¢ 
= (projection of a one + projection of bone) Χ c 


(projection of aone) X ¢ 

+ (projection of bone) Χ ὃ 
=a.c+ b.e. 

So (a+ b).c=a.c+ b.c 

and, similarly, a.(6+ 6) =a.b+ a,c. 


This result is particularly important in mechanics. For if P and Q are 
two forces and ὁ is a certain displacement, then 


(P+ Q).c = P.c+ Q.e. 

So, for any displacement, the work done by the resultant of two forces is 
the sum of the amounts of work done by the forces separately. 
Example | 


If two pairs of opposite edges of a tetrahedron are perpendicular, so is the 


third. 
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Take one of the vertices as the origin O, and let the position vectors of 
the others be a, ὃ, 6. 


Suppose that OB 1 CA and OC 1 AB. 
Then 6.(a—c)=0 and c.(6—a)=0 


=> b.a — b.c = 0 and c.6—c.a=0 
=> a.b=b.e and b.c = ace 
=> a.b=a.ce 

=> a.(6 —c) =0 

=> OA 1 BC. 


Finally, we see that in the particular case when the four points are 
coplanar, each is the orthocentre of the triangle formed by the other four. 


Exercise I1.3a 


1 Letting a and b be the position vectors of two points A and B and 
expanding the scalar product (a — 6)’, prove the cosine rule for AOAB. 


2 Taking Oas the centre ofa circle, points with position vectors a, —a at 
opposite ends of a diameter, and r anywhere else on the circle, prove that 
the angle in a semi-circle must be a right angle. 


4 Let Obe the middle point of the base BC of AABC and let A, B, C have 
position vectors a, b, —b. By expanding (a — b)? + (a + δ)", prove 
Apollonius’ theorem, that 


AB? + AC? = 2(A02 + BO?). 


4 AB, CD are two skew (non-coplanar) lines which have their mid-points 
at P, Q respectively. Prove that 


AC? + AD? + BC? + BD? = ΑΒ’ + CD* + 4PQ?. 


5 <A point P lies in the plane of a triangle ABC, and G is the centroia 
this triangle. Prove that 


PA2 + PB? + PC? = 3PG2 + 4(BC? + CA? + AB?). 


What is the least value of PA? + PB? + PC? as P varies in the plane? 
What is the locus of P when P varies so that 


PA? + PB? + PC? is constant? (0.c.) 


Scalar product in component form 

Suppose that a and b are expressed in terms of unit vectors 2,7 and k: 
a=a,t+ a,j + azk 

and ὁ = δ,ὲ + bog + O3k. 


I 
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We can readily find a. 6 in terms of these components. 


Firstly we note that 2,7, k are unit vectors, soz.2 =j.j = k.k = 1; and 
1,j, k are perpendicular, 50 ἑ.} = 1.k = Ὁ etc. 


Now a.b = (a,t + doj + agzk).(b,t + baz + b3k) 


=> a.b = a,b, + dob, + ab, 


2 


In particular, a α.α - αἵ Ὁ αἢ τ 43 
and eee πε aren area 


Also, if a = li + mj + nk is a unit vector which makes angles 0, @, w 
with the three axes, then 


cos@=a.i=1 
cos@ = a.j =m 
cosW = a.k = n, 


so that /, m, n are known as the direction cosines of a. 
These results are frequently of great use in finding the angle between two 
directions in space. 


Example 2 


If three corners of A, B, C of a crystal are known to be at points (3, 2, 1), 
(4, 0, 2), (5, 2, 4) respectively, we may wish to find the angle 0 between 
AB and AC. 


(4, 0, 2) 


(5, 2, 4) 
A 
(3, 2, 1) 


Taking unit vectors ἢ, 7, Καὶ in the direction of the three axes, 
a=-AB=i-2%+k 
and ὃ = AC = 21 + 3k. 
Then @ =174+2? 417 =6 => a=,/6 
and ὑ -- 2? +0° +37? = 13 => b= ./13. 
Also α.ὃ Ξ 1x24 —-2x041x3=5. 


Now a.b = abcos θ. 
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So 5 = /6 x ,/13 cos 0 

9 Ὁ 
> = ea ie 88 
=> θ -- 55 51΄. 


So ABand AC are inclined at 55° 31’. 


Exercise 11.3b 


1 Find the scalar products of the following pairs of vectors and hence the 
angle between them: 

(1) t+ y and 24 2); 

(ἢ) t+g7+k and i+7—k; 

(1) --ἰ --- 2] --ἰΚ and ἃ -- 7 ἰ Κ. 


In the above rectangular box, OA = 4, OB = 3, OC = 2. Furthermore, 
P is the mid-point of CE; Q divides EG in the ratio 2:1; and R is the centre 
of rectangle BFGD. 

For each of the following pairs of lines, find vectors which represent the 
two lines (in terms of 7,7, &) and hence the angle between the lines 
(1) OP and OD; 
(1) AQ and AB; 
(a) ER and RC; 
(vw) PR and QF. 


3 Find whether the line through (1,0,2) and (4,3, —1) meets the 
line through (0,3, —1) and (5, —2,4). Calculate the (acute) angle 
between their directions. (S.M.P.) 


4 The rectangular cartesian coordinates of two points A and B are (1.9) 
t 
and (— 1,4) respectively. Show that the column vector is always normal 


to AB and hence, or otherwise, find the coordinates of the foot of the per- 
pendicular from the origin to AB. (ἃ:} 
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5 Lines /, m have respectively the parametric equations 


x 2 0 x —2 ] 
z —] 3 Ζ ] —] 


A is the point on / with parameter ¢, , B the point on m with parameter u, . 
Write down an expression for the vector AB. 

Given that the line AB is perpendicular to both / and m, find the values of 
t, and u,, and show that the length of AB is 7/,/5 units. (s.M.P.) 


*11.4 Coordinate geometry of three dimensions 


The equations of a line 


Po is 


To 


Suppose that Po is a point whose position vector is rg and that a straight 
line passes through P, in the direction of the vector τὲ. Then if P, with 
position vector r, lies on this line, it follows that 


r=fr)+Au_ (where Δ isa scalar). 
Suppose now that the components of 
r are (x,y, Z), Yo are (χορ, ζ0; Ζο); and w= are (l, m,n). 


Then x= x% +A, y=yotdAm z=2)4+ An, 


which are called the equations of the line. 


Example 1 


Find the vector and Cartesian equations of the line joining the points 


(1, 2, 3) and (2, 3, 5). 

In this case, rg = 14 + 27 + 3k 
u=it+yt 2k. 

So r= (t+ 2434+ 3k) + A+ 7 + 2k) 

=> γξξ (1]1-Ὁ λ2ὴὲὲὉ (2+ Ag 4+ (3 + 2QA)k, 
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and in Cartesian form: 
χΧτΞ 1 - Δ, y=2+,, z=3+4+2A 
Wa ae se 

EE = 9 


=> 


Example 2 
Find the angle between the lines 
te ped, 228 x yt ll. 242 


and --  - = 


1 2 8 9 8 4 


These are in the directions of the vectors 
t+23+ 3k and 21+ 37 + 4k. 
Now, if the angle between these vectors is 0, then 


(ἡ + 2) + 3Κ). (δὲ + 37) + 4k) = V/14 x «(29 cos A 


=> ae 
.969 9 

=> cos ΤΕ πεν 85 7 = 0 

=> θ = 145 6΄. 

The equation of a plane 


O 


Suppose that P (with position vector r) lies in a plane which passes 
through a fixed point P) (whose position vector 15:50)» and that τι, v are 


two non-parallel vectors in the plane. 


Then r =r, + Au + pv (where A, p are scalars), which is therefore 


the vector equation of the plane. 


Suppose further that the normal to the plane is in the direction n. 


Then n.(r — ro) = 0 


> ΟΥ̓ Ξε πο. 


11.4 COORDINATE GEOMETRY OF THREE DIMENSIONS 187 


If we now suppose that 
n = αἱ + lj + ck, it follows that 
(at + bj + ck). (xt + yj + zk) = 1.19; 


and if we letn.r, = —d, wesee that 


thy tatd=0, | 


which is the Cartesian equation of the plane. 


Example 3 


Find the Cartesian equation of the plane through the point (1, 2, 3) which 
is perpendicular to the line 


eae Mage Oe ee, 
ὃ- τ- - -ᾧἢ 


The direction of this line is given by the vector 3ὲ + 27 + k, and the 
vector from (1, 2, 3) to (x, y, Ζ) 15 


f= ler (y — 2 + (2 — 3)K. 
As these must be perpendicular, it follows that 


3(x — 1) + 2(y — 2) + (z — 3) =0 
> 3r+%+z—-10=0. 


Exercise 11.4a 


1 Find, in both vector and Cartesian forms, the equations of: 
(1) the line joining (0, 1, 2) and (2, 1, 0); 
(11) the line joining (0, 1, 2) and (—2, —1, 0). 


. 


2 Find the equations of: 

(1) the plane which passes through (1,.2, 3) and 15 perpendicular to the 
line (x — 1)/2 =y + 1 = 2/3; 

(11) the plane which passes through (0, 1,2) and is perpendicular to 
t+ 2) + 3k; 

(111) the line through (1, 1, 2) which is perpendicular to 2x + y + z = 4. 
4 Find the angles between the lines 

(4) r=(1+Ati+ (1 —-Aygt (2+ A)k 

and r= (] -- μ)ὲ τ (1 — Quy t+ (1 + yk; 

(ii) Pe ee ἘΞ πὸ 


— and — 


2 ] 3 3 2 ] 
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Angles between planes and lines 


We have already seen how to calculate the angle between two lines in 
space. We now proceed to calculate the angle between two planes; and, 
finally, the angle between a line and a plane. 


The angle between two planes can be defined as the angle between two lines 
in the planes both of which are perpendicular to the line of intersection 
of the planes. Hence it must also be the angle between the normals to the 
two planes. 


Example 4 
Find the angle between the planes 
3x + 24y+z2=0 and x«+2y+4=0. 
The directions of the normals to these planes are given by 
δὲ + 2 +k and i+ 2, 
and the angle between these two vectors is 6, where 


ees (3¢ + 27 + k).(@ + 27) 
J14 x 4/5 


Ξ τς J0.7 = 0.836 7 


«(70 


=> 0 = 33° 12’. 


The angle between a line and a plane is defined as the angle between the 
line (/) and its projection (fp) on the plane (7). 
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n l 


Hence it is conveniently calculated by finding the angle between the 
line and the normal ἡ to the plane, and then taking its complement. 


Example 4 
Find the angle between the line 


Cd Wf 2) 2a 
9 3 4 


and the plane 
4χ Oy Ὁ 22 -Ὁ 1 =. 0: 


Now the line is in the direction of the vector 2ὲ + 37 + 4k and the 
normal to the plane is in the direction 4¢ + 37 + 2k. 
So the angle 8 between these directions is given by 
(22 + 37 + 4k). (4¢ + 37 + 2k) 
“29 x ./29 
= $2 = 0.8620 
=> θ = 30° 27’. 


cos 8 = 


Hence the required angle is 


90° — 30° 27’ = 59° 33’. 


The distance from a point to a plane 


Find the perpendicular distance from the point (x,, y;, Z,) to the plane 
ax + by+ez+d=0. 


Let r, =xt+y,j7 + 24k 
r=xt+ yj + zk 
n=at+ bj + ck. 


Then n.r = ax + by + cz = —d = constant, so that the projection 
of r in the direction of 7 is a constant. 
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O 


Hence n is normal to the plane. But the unit vector in this direction 1s 
n/n; so if the required perpendicular distance is p, it follows that the foot 
of the perpendicular from P, to the plane 18 


γι tp - (the sign depending on the direction οἵ 5). 


But this point lies in the plane. 


So n (rit pt) +a=c 
n 

=> n.r, + pnt+d=0 

= pl- 


Now n.r, = ax, + by, + cz, 


and n= ./(a? + 5? +c’). 


ln.r, + 4 


n 


a lax + by, + cz, + d| 
f(a? + 8? + 0?) 


In exactly similar fashion in a plane, the perpendicular distance from a 
point P, (x,,4,) to the line ax + by + ὁ = Ὁ is given by 


lax, + by, + c| 


J (a? + 67) 


So, for example, the distances of the point (1, 2) from the line 
3x + 4y + 1 = O1s 


Bx1l+4x24] ᾿ 
- τῳ " ἐπ = 2.4. 


11.4( COORDINATE GEOMETRY OF THREE DIMENSIONS _ IQI 
Exercise 11.4b 


1 Find pairs of vectors which are normal to the following planes, and 
hence the-angles between them: 

(1) x + 2y+3z=6 and x«—-y-—z=4; 

(1) x -—y—3=0 and y+z-1=0. 

2 Find the angles between the following planes and lines: 

χα ΠΟΘ ice 


| 2 an 


(4) 3x + 2y+2z= 1 and 


x —_— 

1 2y—z=3 d — =>——_ = 
MO) Χ ΞΕ 2y <2 an Ὁ ri 
3 (1) Calculate the perpendicular distances from (2, 1) to the lines (in 
the plane z = 0) : 


3x +4y+5=0 and Sx — l2y+ 15 = 0. 


(11) Find the locus of a point P (x, y) which is equidistant from these two 
lines. Describe this locus. 


4 (2) Calculate the perpendicular distances from (1, 2, 3) to the planes 
x+2y¥+2z+7=0 and 2χ Ἐν -- 22 --ἰ Ξ  . 


(a) Find the locus ofa point P (x, y, z) which is equidistant from these two 
planes. Describe this locus. 


5 Acube ABCDA‘B’C’D’ has base ABCD and vertical sides AA’ etc. The 
mid-points of the edges AB, AD, AA’ are respectively P, Q, R. The cube 
is cut along the plane POR. 
(2) Find the angle the edge AA’ makes with the plane POR. 
(1) Find the angle the plane PQR makes with each face of the cube. 
(111) Determine the ratio of the volume of the tetrahedron APQR to that 
of the remainder of the cube. 

If the cube is also cut along the plane AB’C, draw a diagram of the 
triangle ΡΟΝ and mark in the line of this second cut. (S.M.P.) 


6 Find the perpendicular distance of the point (3,0, 1) from the line 
whose Cartesian equation is 
ae ee ace eee 
,ϑι 4 1 

(First find the general point P of this line and then the condition for the 
vector from (3, 0, 1) to P to be perpendicular.) 


(M.E.1.) 


7 Find the perpendicular (i.e., shortest) distance between the two skew 
lines: 

r= (ὃ -- 3+ 3k) + λ(--2δὲ + 7 — 2k) 

r= (30 --Ἠ 7 Ὁ 2k) + μ(2ὲ — 37 — 2k). 

8 Find the equation of the plane through A = (2, 2, —1) perpendicular 


to OA. Which point on this plane is closest to the point (2, —1, 2)? 
(S.M.P.) 
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*11.5 Vector products 


If a and b are two vectors, we have seen that we can associate with them a 
scalar quantity, known as their scalar product. We now ask ourselves 
whether there is also a convenient vector product which we could usefully 
consider. 


It is clear that the two vectors a and 6 fix a particular parallelogram, whose 
area is easily seen to be αὖ sin 6. Furthermore, the most natural direction 
to associate with this parallelogram would be the one which is perpen- 
dicular, or normal, to its plane. As for the particular sense (up or down this 
line), we can easily resolve our dilemma by following the movement of a 
right-handed screw which is turning from a to b. 


axb 


We therefore define the vector product, which we shall writet a X ὃ, as 
the vector 
whose magnitude is ab sin 0; 
whose direction is perpendicular to a and 6; 
and whose sense is determined by a right-hand screw turning from a to b. 

The most common use of vector products occurs in mechanics, when a 
force F acts through a point whose position vector is r and whose moment 
about O is defined asr X F: 

In this figure it is seen that the magnitude of this moment is rF'sin 0 = Fp, 
and so is equal to the product of the force and the length of the perpen- 
dicular arm from O. The moment itself is a vector of this magnitude which 
acts perpendicular to the plane of r and F. In our example, therefore, by 


+ Also denoted by a A b. 
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the right-hand screw rule, this moment acts into the paper and represents 
a turning effect which is in the plane of the paper and clockwise. 

As with the scalar product, the properties of the vector product follow 
from its definition. Some are expected, like 


a X (Ab) = (da) Χ ὃ = λία X δῚ 

and can easily be demonstrated in a figure. Others are more surprising, 
like axXaz=0O0 (since aasin 0 = 0) 

and axb=-b*xXa 


(because a right-hand screw turning from a to 6 moves against a right-hand 
screw turning from b to a). So vector products are not commutative. 

Lastly, however, vector products (like scalar products) ave distributive 
with respect to addition: 


ax(b+c)=axb+axe. 


We shall not establish this until the end of this section, though in passing 
we remark on its importance if P and Q are two forces. 


For the result means that 
rx (P+Q)=rxPirx@d 


1.6. that the sum of the moments of two forces is equal to the moment of 
their resultant. 

Vector product in component form 

If a= at + AoJ + ask 

and ὃ ΞΞ bt + boJ + b,k, 


we can easily find the vector product a x ὃ. 
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For, firstly, we notice that k 
~Xxt=yxg=akxk=0 
andthat +xXj=k=-jX1 
jxk=i= —-kxj j 
kKxti=j=-1txk. 


So @X ὃ = (a,;t + aoj + a3k) X (6,2 + bo7 + b3k) 
= (dgb3 — d3bg)t + (a3b, — α,δ4)} + (α,. — agb,)k. 


Example 1 


The position vectors of three points A,B, Carea = ἡ, δ =1+7,c=7+ k. 
Find the angles between: 

(2) the planes OBC and OAC; 

(1) the line OB and the plane ABC. 


(1) ‘Two vectors in plane OBC are 6 and ¢, so the normal to the plane 15 
along 


bxc=(t+ 7) xX (ἡ) τ ΑἹ) 
=txgtixk+yxgr+rysyxk 
=k-jyt+0+1=1-j+k; 
and similarly the normal to OAC is along 
axec=txGtk)=k-yg= jk. 
Now the angle θ between two planes is the angle between their normals, so 
(@—j+k).(-j + k) 2 ὁ 
θ SOS a - = = 
ny "78 x /2 J6 v3 
=> @= 35° 16’. 
(11) Similarly, two vectors in plane ABC 
are AB=b-a-=j 
and AC=c—-az= —-i+ 7: . 
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So the normal to the plane is along 
jx (-t4+j7+k =itk. 
Now OB = δ τ-- τ]. 
Hence the angle between OB and this normal is @, where 


See ὐβ 24 
2 AG : 
=>  - 60°. 
So the angle between OB and the plane itself is 90° — 60° = 30°. 


Scalar and vector triple products 


195 


We now proceed to ask if there is any way in which we can speak of the 


product of three vectors, a, ὃ. 6. 


Firstly, it is clear that the expression a.(b.c) is meaningless, since b.c 
is not a vector but a scalar and so cannot form a scalar product with a. 
By contrast, however, a.(b Χ c) does exist and, as it is a scalar quantity, 
is called a scalar triple product. Furthermore, a X (6 X e) also exists and 


as it is a vector quantity is called a vector triple product. 


Though these quantities are extremely important in applied mathe- 
matics, we cannot investigate them further and simply note the connection 


between scalar triple products and volumes: 


Suppose that a parallelepiped is formed from the three vectors a, ὃ, 6, as 


shown in the figure. 


Then ὁ X ¢ is a vector of magnitude dc sin 6, which is equal to the 


area A of the base parallelogram, in a direction normal to the base. 


So a.(b X c) is of magnitude (projection ofa 1 bande) x A 


= perpendicular height x area of base 


= volume of parallelepiped. 
Hence volume of parallelepiped = α. (ὃ X ¢) 


and it clearly follows that: 


volume of parallelepiped is zero <> a, b, c are coplanar > α. (ὃ X c) = 0. 
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Furthermore, in components: 
α - αἱ Ῥ aj + a3k 
ὃ Χο - (blz -- ὑ50,)ῖ + (50, -- δ,04}} + (bycg -- bgc,)k 
=> a.(b Χ ὁ) = αχγ(δ,ος -- δ402) + ag(b3e, -- δι03) + a3(byCo — bye). 


So a, b, Ο are coplanar 
<> 4,(b2¢3 — δ509) + Ag(b3e, — byez) + a3(byCg — boc) = Ὁ 


which is a result to which we shall return in chapter 13. 


Exercise 11.5 
ι Ifa=i-j,b=i1+j+k,c = Καὶ, calculateb X c,c X a,a x Bb. 
2 Denoting corresponding points in no. 1 by capital letters, find: 
1) vectors which are normal to the planes OBC, OCA, ABC; 
(11) the angle between the planes OBC and OCA; 
2 


(111) the angles between these planes and the plane ABC; 
(10) the angles between these planes and the line AB. 


42 Inno. 1 find whether: 
(1)] a.(6xXc) = b.(e Xa) =c.(a x δὴ; 
(7) ax (bx c) = (ax b) Χο; 
(11) aX (6X c) = (a.c)b — (a. ble. 
Test these results with other vectors a, 6, c. 
4 Let the three sides of a triangle ABC have vectors a, b, ¢ so that 
at+b+c=0. 
Hence prove 
(1) bXe=cxa=axb; 
(11) the sine formula for AABC. 


* The distributive law for vector productst 


ax(b+c)=axbtaxe. 


Firstly, we project 6 and ὃ on to the plane perpendicular to a, and let 
their projections be 6, and σι. 


+ Or, more strictly, that the operation of vector multiplication distributes over that of vector 
addition. 
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If the angle between a and b is 0, then a Χ b has magnitude αὐ sin 0. 


But a Χ ὅδ᾽, has magnitude ab, 
= ab cos (ἐπ — 6) 


ab sin θ. 


Also, as a, 6 and δ᾽ are coplanar, a X ὃ anda Χ ὃ, both lie along the 
perpendicular to this plane. 


So axb=axb,; 
similarly axc=axXc,, (1) 
and aX (b+ c)=ax (δι + ¢,). 

We now concentrate on the plane perpendicular to a, of which we can 


take a bird’s eye view: 


Pp’ 


Now aX ὃ, has magnitude ad, 


and aXe, _ has magnitude ac,. 
So the vectors 

axb,, axec,, ax (δ, + 64) 
are formed by first multiplying 

bes: Crs UDR AEG; 


by a and then rotating through a right angle. 
But it is clear that 


ax (δι, + ει) 

is the diagonal of the parallelogram formed by 
axb, and axXec,. 

So aX (δ, +¢,)=axXb,+ aXe, 
from which it follows by (1) that 
ax(b+ec)=axbiaxe. 
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*11.6 Plane kinematics 


Ifa point P is moving in a plane, then we have already seen that (relative 
to a fixed point O) its position vector r, velocity v and acceleration a can 
all be expressed in component form: 


r 
υ τε ἡὶ - χὲ -ἰ yy, 
Yr 


| 
= 
Ἢ 
ἕξ: 


These, of course, are based on Cartesian axes Ox, Oy, and the question 
now arises whether we can possibly obtain similar results using polar 
coordinates. 

Let us start by supposing that at time ¢ the point P has (relative to a fixed 
point O and a fixed base-line), position vector r and polar coordinates 
(r, 8), where r, θ are functions of ¢: 


We now let p and q be unit vectors along and perpendicular to OP; so 
that as P moves, p and q change direction, rotating with angular velocity 6. 


We immediately see that 


p=6q and g =—Op. 

But r = 7p. 

So ἡ τι ip + mp = tp + τθᾳ. 

Furthermore, ἢ = ip + fp + ὕθᾳ + rbq + rOg 
= ip + ἴθᾳ + iq + rq — τθῦρ 
= (ἢ — τγθ2» + (rb + 270)q. 
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Summarising, 


=fpt γθῃ 
= (ἢ -- γθ2)» + (rf + 270) q 


Hence the radial and transverse components of the velocity and acceleration 
of P can be denoted: 


r6 + 270 
γῇ 


Velocity: Acceleration: 


τς τ 


O 


In the particular case when P is moving in a circle of radius a, 


we on 
Ζ << “Ne P 
ἡ Ν 
/ a \ 
/ \ 
| Ι 
| | 
\ ! 
λ / 
\ / 
ae 7 
ee ee 
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Furthermore, if P is moving with constant angular velocity 6 it follows 
that § = 0, so that the acceleration of P is again shown to be a6? directed 
towards O. 


oo πῶς 
΄ ςς 
/ ou ‘yp 
/ 4 i 
| ap | 
\ j 
Bre Ε΄ 
ως Ps 


Example 3 (see also 8.1) 


In 1611 John Kepler announced that each planet moves so that a line 
from the Sun ‘sweeps out’ area at a constant rate (i.e., so that equal areas 
are swept in equal times). Deduce that the planet’s acceleration must be 
along this line. 


Ρ 


If we suppose that in time δὲ the planet moves from P to P’ and the 
additional area swept out is 6A, then with the usual notation, 


P 


5A, , 60 

= Se — 
δὲ  Σ δι 
dA 

pare 


So if dA/dé is constant, it follows that 
Ὁ 
dt 

=> 76 + 277 = 0 

= γῦ + 210 = 0. 


(υ20) = 0 
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So the transverse component of the acceleration is zero, and the acceleration 
must be entirely radial. It was this discovery, although not expressed in 
this notation, that halted men’s search for transverse forces propelling the 
planets, and caused Newton to propose forces towards the Sun holding 
them in orbit.) 


Exercise 11.6 
1 The polar coordinates r, θ of a particle after time ¢ are given by 
pe 0 = 2t. 


(1) Sketch its path, and state its polar equation. 

(ii) Find the radial and transverse components of velocity and accelera- 
tion after time ἔ. 

(iii) Show that its direction of motion is always inclined at 45° to the 
radius and that its radial acceleration 18 zero. 


2 


A ship sails away from a lighthouse keeping the light on its port quarter at 
a constant angle a. 


. 16 . dé tana 
Show that: (1) — = tang; (2) —= 5 
i 


(iii) ifr =a when@O=0, then r= ae?**, Sketch this path. 


3 


O 


A bead is placed on a straight smooth horizontal wire which is rotating 
about its end O with constant angular velocity ὦ, so that θ᾽ = at. As there 
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is no radial force on the bead, its radial acceleration must be zero. Hence 
show that 


(1) Υ̓ = wr — 0, and r= A ev + Be: 


2 


(1) if the bead is initially at rest at a distance a from O (i.e., ἐ = 0,7 = a 
andr = 0), then r = acoshaet; 


(1) ἐπι οι. -- a7). 


4 Describe the curve given by 


5 cos (t?) 
r= . 9, }> where ¢ denotes time. 
5 sin (t*) 


Find a(= #) and deduce the radial and transverse components of the 
acceleration. 
A boy slides down a banister such that his path is given by 


5 cos (t*) 
r = 15 sin (¢*) ], 


kt? 


where the units are metres and & is a constant. Indicate on a diagram the 
shape of the banister and the magnitudes of the boy’s acceleration com- 
ponents in the directions (a) towards the axis of the curve, (6) vertically 
downwards, (c) in a direction perpendicular to those in (a) and (8). 

(Note. The unit vectors for r are horizontal south, horizontal west, and 
vertically downwards. ) (S.M.P.) 


5 A bead is threaded on a smooth wire in the shape of a cycloid 
x = α(θ -- sin 0), y = a(1 + cos 8), which is fixed in a vertical plane with 
the positive y-axis as the upward vertical. The bead is released from the 
position given by θ = 0. Prove that, in the subsequent motion 

(1) d6@/dt is constant; 

(11) the acceleration vector has constant magnitude. (0.c.) 


11.7 Impulse and momentum 


Suppose that a particle of mass m is pulled in a horizontal line by a constant 
force of magnitude F, and that in an interval of time ¢ its speed changes 
from vg to v, with an acceleration of magnitude a. 

Since the acceleration is uniform, 


time 0 ; é 
Vv, — Up = αἱ. ᾿ 
velocity ---.-.-Ρυὕὸὕὺς- ------- 
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Also, by Newton’s second law, 
F = ma. F 


Hence Ft = mv, — mo. 


The expression Ft is called the impulse of F during the interval t; and if, 
further, the product of mass and velocity is called the momentum, we see 
that mv, — mug is the increase in momentum. 


So Impulse of force during interval = Change of momentum 


In SI, the unit of impulse Ft is 1 Ν 5 (and, since 1 N = ] kgm s~? 18 
the same as 1 kg ms~', which is also the unit of momentum, mv). 


Example 1 


A stone of mass 2 kg is thrown vertically downwards from a high cliff with 
velocity 4 ms‘. Find the impulse of its weight in the first two seconds of 
its fall and verify that this is equal to its increase in momentum. 

Taking g = 9.8 ms”, the stone’s weight is 19.6 N and the impulse of 
this weight is 19.6 x 2 = 39.2Ns. 


Also, its velocity after 95=44+98 x 2=236ms '. 


So initial momentum = 2 x 4 = ὃ Ν 5, 
final momentum = 2 x 23.6 = 47.2 Ns, 
and increase of momentum = 39.2 Ν 5. 


Impulse of a variable force 


It frequently happens when a sudden blow is struck — for instance when 
a ball is hit by a cricket bat — that a very large force acts for a very small 
interval of time. In such cases it is often either impossible or unnecessary to 
investigate either the magnitude of the force or the duration of its action, 
and attention is confined to the impulse of the force and the consequent 
change of momentum. Moreover, the force will usually vary with time: the 
thrust of the bat on ball increases very rapidly from zero (when contact is 
first made at time ἐρ) to a maximum value, and then decreases just as 
rapidly to zero (when the ball finally leaves the bat at time ¢,). 
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to ot by 


In such cases the total impulse during the interval will be the limit of the 
sum of small impulses F δί. 


ti 
So impulse = lim δ᾽ F δὲ = | F dt. 
to 
dv 
Moreover, at any instant F = m a 
ty ti d 
=> | rat= | m— dt =[mv]°. 
‘ te dt ᾿ 
So, again, 


Impulse of force during interval = Change of momentum 


Exercise 11.7a 


Calculate the momentum (in N s) of: 
1) acar of mass 1 tonne travelling at 30 ms_'; 
11) a lorry of mass 10 tonne travelling at 10 τῇ 5. ὃ; 
iit) a pellet of mass 5 g moving at 150 τῇ 5. ἢ; 
1 


iv) an electron of mass 9 x 10~*' kg moving at 3 x 10’ ms". 


2 Calculate (in SI units) the impulse of a force: 
(2) 100 N acting for 3 s; 

(1) 6¢N, fromt = Otot = 2; 

(11) 40° N, fromét = 1 tot = 3; 

(wv) 2sintN, from¢ = 0 tot = ἐπ. 


3 Acar of mass 1 tonne moves from rest under the action of a tractive 
force and a resistance whose difference (the effective force) has a constant 
value of 400 N. Find: 

(1) the impulse of the effective force in the first 5 s; 

(11) the resulting momentum; 

(111) the resulting velocity. 


4 Repeat no. 4 if the effective force diminishes steadily from 400 N to zero. 
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5 Repeat πο. if the effective force after time ¢ is 16(5 — ¢)* N. 


6 A lorry of mass 10 tonne is travelling at 30 πὶ 5. * when its brakes are 


applied for a period of 2s. If the braking force has the constant value 
20 000 N, find: 

(1) its impulse; 

(11) the final momentum; 

(11) the final velocity. 


7 Repeat no. 6 if the braking force increases steadily to 20 000 N in the 
2 s interval. 


8 Repeat no.6 ifthe braking force increases steadily from zero to 20 000 N 
in one second and then decreases steadily to zero after another second. 


9 Aballofmass 1 kg strikes the cushion ofa billiard table perpendicularly 
at a speed of 3ms~‘ and rebounds at 1 ms *. What is its change of 
momentum? If the total time of impact was 0.02 s and the force of the 
cushion is supposed constant, find the value of this force. 


10 Repeat no. g on the different supposition that the force of the cushion 
increases steadily from zero and then decreases steadily to zero (all within 
0.02 s). What is the maximum force? 


11 A jet of water of radius 1 cm and speed 4 ms_* plays directly on to.a 
steel plate which completely destroys its momentum. Find: 

(1) the loss of momentum of the water which strikes the plate in | s; 

(22) the force of the water on the plate. 


12. Amachine gun fires bullets of mass 30 g with muzzle velocity 500 m's” 


and at a rate of 200 per minute. What average force must the gunner exert 
to prevent recoil? 


12 Ata waterfall, a stream (which has no vertical velocity at the top of 
the waterfall) falls 5 m vertically on to a horizontal rock without rebound- 
ing. Given that 150 dm? of water pass over the waterfall each second, cal- 
culate the force exerted on the rock. (S.M.P. ) 


1 1 


14 A horizontal jet of water delivering 5kgs © at a speed of 1Oms_ 
hits a vertical wall. Assuming that the water does not rebound, calculate 
the force on the wall. If allowance were made for rebound, would this force 
be increased or decreased? Give a reason for your answer. (c.) 


15 A train of mass 1 000 tonne started from rest and the effective hori- 
zontal force was recorded at intervals of ten seconds: 


t/s 0 10 20 30 40 30 60 


F/10° N 162 145 1.36 1.24 1.07 O91 0.73 


Find the total impulse of this force and hence the final momentum and 
velocity of the train. (The use of Simpson’s rule is recommended.) 
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Impulse and momentum as vectors 
So far we have confined ourselves to the study ofa body moving in a straight 


line. We can now consider the more general case of a body of mass m moving 
under the action of a variable force F. 


Suppose that after time ¢ the particle has velocity v and acceleration a; 
and that when 


c= to, v= v9; 
t= 1,, v= 2,. 
By Newton’s second law, 


d 
Peay 


di’ 


so that we naturally write 


ty ty 
| Fdt = | ea 
ts to dé 


= [mv]'=" = mv, — mvo. 


This is the first time that we have dared to speak of the integral of a 
vector quantity, but success with the differentiation of vectors is a strong 
incentive for such a step; and in any case, the expression 


ty 
| Fai 
to 


can be regarded as shorthand for the vector whose components are 


ti t1 t1 
| X dt, | Y dt, | Ζ αἱ 
to to to 


(where X, Y, Z are the components of F). 
We therefore see that the impulse of a force during an interval of time, 
like the momentum mz, is essentially a vector quantity; and, again, that 


Impulse of force during interval = Change of momentum 


The 2 kg mass of Example 1 is now thrown horizontally with a speed of 
4ms ᾿. Investigate the impulse of its weight and the change of its momen- 
tum in the first two seconds of its flight. 
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Taking unit vectors 7 horizontally and 7 vertically downwards, we see 
that | 
weight = 2 x 9.87 = 19.69, 
=> impulse = 19.67 x 2 = 39.27. 
Also initial velocity = 42 
and _—sfinal velocity = 4: + 9.8 x 27 
= 41 + 19.67. 
So initial momentum = 42 x 2 = 81 
and final momentum = (42 + 19.67) x 2 
= 81 + 39.27. 
Hence change in momentum = 39.27 and we confirm that this is equal 


to the impulse of the weight. This can also be summarised diagram- 
matically: 


MVo 
39-2 Ns 
mv, 
I 
Impulse = J = mv, — mvo 


Example 3 


Water flowing in a horizontal pipe of radius 2 cm at a speed of 3 πὶ 8. first 
goes round a sharp right-angled bend and then issues as a jet perpendicular 
to a fixed metal plate which destroys its momentum. Calculate: 
(1) its thrust on the bend; 
(11) its thrust on the plate. 

In 1 s the stretch of water coming out of the pipe has length 3 m and 
volume 


nm xX (0.02)? x 3 = 122 x 101 m’. 


Now 1 m?® of water has mass 10° kg. So the mass of this water is 1.27 kg, 
and its momentum is 


12n x 3 = 3.6" Ns. 


Now after going round the bend its momentum changes direction 
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through a right angle, and such a change in momentum must be caused by 
an impulse J such that 


MV 


Hence the impulse J acts into the bend, at 45° to the two sections of pipe, 
and has magnitude 


3.6n x /2 = 16Ns. 


But this is the impulse, over an interval of 1 5, of a force of 16 N. 

The thrust of the water on the pipe, therefore, is of magnitude 16 N 
outwards from the bend at 135° with both sections. 

When the jet strikes the metal plate, the momentum 3.67 N 5 is reduced 
to zero and this therefore necessitates an impulse of 3.62 Ns which 15 
caused by a thrust of 3.62 N acting for 1 second. So the thrust of the plate 
on the jet, and hence of the jet on the plate, has magnitude 3.67 = 11.4N. 


Exercise 11.7b 


1 Find (in SI) the impulse of: 

(1)] aforce 62 + 27 + k, acting for 2 s; 

(1) aforce --ἰ — 37 + 2k, acting for 9 5; 

(2) aforce i + (1 — ἢ) + 307k, from t = 0 to 2; 
(iv) a force sin ti + tj, from t = 0 to ἐπ. 


2 Find the final velocity of a mass of 2 kg which has initial velocity 
47 + 37 + 2k and is acted upon (on separate occasions) by the forces 
described in no. 1. 


3 <A body of mass 4 kg has initial velocity 4¢ + 27 — k. What constant 
force must act upon it so as to produce: 

(1) velocity 32 + 27 + 4k after 2 s; 

(11) velocity2 + k after 5s. 


4 A sudden downpour of r cm of rain, of density p kg π᾿ °, falls in one 
hour on an area of A km? and f per cent of this water runs straight off the 
ground into a river ὁ m wide. What is the likely rise in level of the water 
in the river if this flows atvms *. 

A containing bank turns the river-flow through a right angle. What is the 


extra force on this bank due to the downpour? (M.E.I.) 
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Conservation of momentum 


So far we have limited ourselves to the consideration of an impulse upon a 
single body and its consequent change of momentum. But it is when a 
number of bodies are connected or interacting that the concept of momen- 
tum becomes most valuable. 

Suppose that A and B are two particles and that the only forces acting 
upon them are their mutual reactions R and — R which can (as when two 
billiard balls clash) be changing both in magnitude and direction. 


we 


Over any interval of time, 


ti 
| R di = change in momentum of A, 
t 


0 


th 
| —R dt = change in momentum of B. 
t 


0 


ty ty ty 
But | και 4} -Rat= | 0 dt = 0. 
to to to 


So the total change in momentum is zero, and we have verified the 
principle of conservation of momentum for a system which has no external 
forces acting upon it. 


Example 4 


A mass of 2 kg moving with velocity ὦ — 27 + 3k coalesces with a mass of 
8 kg which is moving with velocity -- + 37 + 6k. Find the velocity of 
the combined mass of 10 kg. 

If we let this final velocity be v, we can apply the principle of conser- 
vation of energy and obtain: 


lv = 9(ἱ — 2) + 3k) + 8(--ἰ + 3) + 6k) 
—6i + 20j + 54k 
—0.6i + 2) + 5.41. 


=—> Vd 


Example 5 


A shell of mass m is fired with velocity v at an angle « to the horizontal. 
If the gun has mass M and recoils horizontally on smooth rails, find 118 
speed of recoil, V. 


210 FURTHER VECTORS AND MECHANICS 


Impulses Momenta 


ee 5 ae | 
wr -- ΤΠ} ---- 


The forces acting for the short interval of the firing are: 
(1) the weights Mg and mg, whose impulses over a very short interval are 
negligible ; 
(1) the very large explosive force between gun and shell which has equal 
and opposite impulses J and —J on the gun and the shell; 
(111) the normal reaction between the rails and the gun which exerts a 
vertical impulse J on the gun. 

So the only external impulse is J, which has no horizontal component. 
Hence the horizontal component of momentum must be conserved. 


=> mucosa —-MV=0 


mv COS ἃ 


M 


Exercise 11.7¢ 


1 Find the velocity of the combined mass formed by the coalescing of: 
(1) 1 kg moving at 4¢ + 37 — k, 
and 2 kg moving ati + 37 + 2k; 


(11) 3kg moving at? +7 + k, 
2 kg moving at 22 + 37 — 3k, 
1 kg moving at 3¢ + 27 — 3k. 


1 


3 
2 A mass of 2 kg moving with velocity a ms ~ collides and combines 


Ι 
with a mass of 3 kg moving with velocity ( 4 ms’. 
Calculate the velocity of the single mass so formed. 
Find also the impulse which each mass has received. (S.M.P.) 


3 Arocketofmass 10 kg is moving with a velocity whose three components 
(ἢ πὶ 5. 1) are (200, 160, 4) and then separates into two parts. If the rear 
portion has mass 8 kg and velocity (100, 100, 0), find the new velocity of 
its front portion. 


1 


4 A bullet of mass 0.05 kg moving horizontally with velocity 321 τὴ 5. 
strikes a stationary block of mass 16 kg which is free to slide without rotation 
on a smooth horizontal plane. The bullet becomes embedded in the block 
after 0.01 s. Calculate, in newtons, the resistance, assumed uniform, of 
the block to penetration by the bullet. 
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5 A rocket of mass 40 tonnes (= 4 x 10* kg) is mounted rigidly on a 
trolley of mass 10 tonne which is free to move horizontally. The rocket 
ejects mass 1 tonne horizontally in a burst lasting 5 seconds, giving the 
ejected matter a speed which may be taken as 2 kms ᾿ relative to the 
ground. Calculate the velocity of the rocket just after the ejection (1) neg- 
lecting resistances, and (11) assuming resistance of 4 x 107.N. 

If the same operation were carried out with the rocket mounted vertically 
and free to lift offits mounting show that, for a lift-off to occur, the duration 
of the burst (assumed uniform) must not exceed a certain critical time, and 
calculate this time. (The question of resistance does not arise in this cal- 
culation. Explain why.) (Take g = 10 ms *.) (M.E.I.) 


6 Two stones, each of mass 5m, are moving across a sheet of smooth ice at 
equal speeds of 10v in opposite directions on parallel paths, so that no 
collision is involved. A frog of mass m, travelling on one of the stones, leaps 
across to the other one, and in so doing deflects the stone he leaves through 
30° and changes its speed to 8v. 

Find, by drawing and measurement or by calculation, (1) through what 
angle the other stone is deflected and its subsequent speed; (11) the (vector) 
impulse the frog exerts on the stone on which he lands. (S.M.P.) 


Newton’s law of impact 


Suppose that two bodies of mass m,, m, collide and that the components 
of their velocities along the line of impact (i.e., along the common normal 
at the point of contact) 


my m2 


before the impact are μ|, 9; Φ------»ἅ:. o—_—__» 
uy Us 
my M2 
and _ after the impact are v,, vo. — ο---- οὕς. 
1 ὥ) 


Then Newton’s experimental law of impact states that 
Vy — Vg = —e(u, — Ug), 


where e is a constant for the pair of bodies known as their resilience, or 
coefficient of restitution. 
Alternatively, this can be expressed as 


Vg — Vy = elu, — Uy), 


that their relative velocity of separation is proportional to their relative 
velocity of approach. 

If the second body is at rest, for instance when it is a fixed hard floor on 
to which the first body is dropped, it follows that v; = —eu,; and it is 
quickly seen that e can range from 0 (in the case of a piece of putty dropped 
on to the floor) to nearly 1 (for a ball with a very good bounce, which is 
said to be almost perfectly elastic). 


So O<e<l. 
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Example 6 


A ball is dropped from a height of 5 m on to a hard horizontal pavement. 

Ife = 4, calculate: 

(1) the height to which it returns after its first bounce; 

(11) the total distance travelled after repeated bounces; 

(111) the total time taken. 

(Take g = 10ms 7°.) 

(¢) After falling from a height of 5 m the speed of the ball 1s 

SOx 10 5 ΞΞΙΟ 

and since ὁ = 4, it rebounds with a speed of 5 x 10 =5ms_*. 
Hence it will rebound to a height A, where 

S210 κ ἢ SS θη: 


(11) Just as the height of the first rebound is 2 x 5m, so, similarly, the height 
of its second rebound is 


Lx x 5) τ ()}" x 5m 
and the height of its third rebound is 
+ x (4)? x 5 = (4)° x 5m, etc. 


Hence its total distance travelled (up and down) in making an infinite 
number of bounces 


=5+24x«x54+ 4% «54+ 4)? x54+---} 
=54+2xix 5{f1+44+@?+:--3} 


l 
ae ay --  5- 8 χ  -Ξ 8m. 


Ὡοῦ 
4 
(111) The time taken to drop from 5m = 1] 5; 


the time taken between first and second bounces = 2 x τὸ = 18; 


5 
2 


10 


the time taken between second and third bounces = 2 X 58; etc. 


Hence total time taken = 1 + (1 +35 4+4+-::) 
=l1+2= 3s. 


So the ball comes to rest (after an infinite number of bounces) after 3 s and 
travelling a total distance 85 m. 


Example 7 


A ball of mass 2 kg is moving with velocity 3 ms * and collides directly 


with another of mass 1 kg which is moving in the opposite direction with 
velocity 5ms'. Ife = 4, find their velocities after impact. 
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2 Ι 
Before impact 46---------...ᾧ.»-ὄ ας -------Φ 
3 
2 ] 
After impact .--ςς-ςςς--. ——— 
vy v2 


Let the final velocities be v, , v. respectively. Then as there is no external 
force acting on the system, momentum is conserved : 


20, tv =2xX34+1X -5 
=> 20, + vg = |. 
Also, by Newton’s law of impact, 


Ve — 0 = | X (5 + 3) 


So 20, tv. = 1 
al vp = -, Vg = 7H. 


Hence the final velocities of the two balls are —14 ms‘ and 45 ms’. 


Exercise 11.7d 


ΙΑ ball of mass 4 kg moving at 3 ms‘ catches up with another ball of 
mass 2 kg moving at 1 ms _' in the same straight line. Ife = 3, find their 
velocities after impact. 


2 A ball of mass 5 kg moving at 4 πὶ 5. ᾿ catches up with another ball of 
mass 2 kg moving at 3 ΠΊ 5. ᾿ in the same straight line. If the second ball 
has velocity 4 πὶ 5. * after impact find the coefficient of restitution and the 
final velocity of the first ball. 


4 Aball of mass 4 kg moving with speed 2 ms” * meets a ball of mass 2 kg 
moving in the opposite direction with speed 3 πὶ 5. ἡ. If e = 4, find the 
final velocities of the two balls. 


4 A ball of mass 3 kg moving at 5 ms ᾿ meets another ball of mass 6 kg 
moving in the opposite direction at 83 πὶ 5. ΄. If the impact brings the 
second ball to rest, find the final velocity of the first ball and the coefficient 
of restitution. 


5 Asmooth sphere A of mass 1 kg moves with speed 5 ms‘ on a smooth 
horizontal plane directly towards a vertical wall. Before it hits the wall it is 
struck from behind by a smooth sphere B with the same radius as A but of 
mass 2 kg moving towards the wall along the same line as A with speed 
10 πὶ 5. ‘. Given that the coefficient of restitution between the spheres is 3, 
find their new velocities. 
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The sphere A later strikes the wall and rebounds, the coefficient of 
restitution being ΣΦ. It then collides again with B which has been following 
it towards the wall. Show that, after this collision, both A and B are again 
moving towards the wall, the speed of A being three times that of B. 

(Μ.Ε.1.) 


6 Three identical imperfectly elastic smooth spheres A, B, C are at rest 
in a straight line, but not in contact, on a smooth horizontal plane in the 
order named. A is given a velocity towards B. Prove that there will be at 
least three collisions during the subsequent motion. 

If the coefficient of restitution between each pair of spheres is >, deter- 
mine whether there is a fourth collision. (L.) 


7 The centres of three equal spheres A, B, C, at rest on a smooth hori- 
zontal table, lie on a straight line. The coefficient of restitution between A 
and B is 6 and between B and C is ε΄. The sphere A is projected to strike B 
directly and B then strikes C. Show that there are no further collisions if 
e’ < (3e — 1)/(1 +e). (νν.) 


8 A ‘Supaball’ when dropped on ἃ hard floor displays a coefficient of 
restitution, e, of very nearly unity, and e may be assumed to be independent 
of the velocity of approach. When the ball is dropped vertically from a 
height of 2 πὶ on to a hard floor it comes finally to rest 26.0 s after being 
released. What is ¢? What is the total distance that the ball has travelled? 
[σ΄ =98ms 1]. (6.8.} 


11.8 Work, energy, and power 
Movement in a straight line 


Suppose that a mass m is pulled along a smooth horizontal table by a 
constant force of magnitude F and that after moving a distance s its velocity 
has increased from vg to v,. 


Yo ᾿ vy 
---»- 


a 
If the magnitude of the constant acceleration of m is a, 
then v2 = vo 4+ 2as. 
But, by Newton’s second law, 
F = ma 
=> Fs = mas = tmz — 3m. 


The expression F's is called the work done by F in the displacement s; and if, 
further, the expression $v? is called the kinetic energy of a mass m moving 
with velocity v, we see that 4mvz — ἐπιυδ is its increase in kinetic energy. 
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Work done by the force 
in the displacement = Increase of kinetic energy 


In SI, the unit of work (and therefore of energy) will be 1 N m, which is 
called 1 joule ( 1 J). 
Example 1 


A car of mass 800 kg on a horizontal road accelerates from a speed of 
10 τὰ 5.1 under the action of a tractive force of 500 N and against a 
resistance of 100 N. Find its speed when it has travelled a distance of 20 m. 


Work done by tractive force = 500 x 20 = 10000 J. 
Work done by resistance = —100 x 20 = —2000 J. 
Hence total work = 8 000 J. 
If the final speed = υ, the gain in kinetic energy 
=  χ 800v? — + x 800 x 10? 


= 400(v* — 100). 
So 8 000 = 400(υ" — 100) 
=> v* — 100 = 20 
> v> = 120 
=> v = 10.95. 


So final speed is 10.95 τὴ ς΄. 


Exercise 11.8a 


1 A man pushes a chest of mass 20 kg through a distance of 6 m across 
a horizontal floor by means of a horizontal force of 50 N. 
Find: 
(¢) the work done by this force; 
(11) the final velocity of the chest if the floor is smooth; 
(111) the final velocity if the coefficient of friction is 0.1. 
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2 Ignoring all resistances, what mean propulsive force is required to 
accelerate an aircraft of mass 10 tonne from rest to a take-off speed of 
40 ms! on a runway of length 1 km? What force would be required if 
the take-off speed had to be doubled in the same length of runway? 


3 A bullet of mass 20 g is travelling at a speed of 400 πὶ s*. If it pene- 
trates a block of wood to a depth of 5 cm, find the average resistance. 


4 Astoneofmass2 kg is skidding across ἃ sheet ofice at a speed of 1Oms~ 2 


If it comes to rest in 80 m, find: 
(1) the work done by friction; 
(11) the coefficient of friction. 


5 A lorry of mass 8 tonne needs to brake from a speed of 10 πὶ s ‘ina 


distance of 80 m. 

Find: 
(2) the work done by the braking force; 
(1) the average braking force; 
(iti) the average braking force required if this distance were to be halved; 
(iv) the average braking force required if the initial speed were to be 
doubled. | 


6 Express a vehicle’s braking distance d in terms of its speed v and maxi- 
mum braking force F. Supposing that F is constant, sketch a graph showing 
das a function of v. 


7 A mass of 2kg moving at 3m s 1 catches up with a mass of 3 kg 
moving at 2ms_! in the same direction. Supposing that the bodies are 
perfectly inelastic (6 = 0), find: 

(1) their velocities after impact; 

(11) their total loss of kinetic energy. 


8 Repeat no. 7 if the bodies are elastic, with e = 7. 


g Repeat no. 7 if the bodies are perfectly elastic, with e = 1. 


10 A nail of mass 10 g is driven 1 cm into a wall by a hammer of mass 
100 g which strikes it with a speed of 2m s_'. Calculate the average 
resistance of the wall on the assumption: 

(i) that there is no resilience between hammer and nail (6 = 0); 

(ii) that there is perfect resilience (6 = 1). 


Variable forces 


When F is a variable force, the total work done in a displacement s will 
be the limit of the sum of small amounts of work F x 6s and is represented 


by the area under the F, s graph: 
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Elastic forces : Hooke’s law 


One of the commonest instances of a variable force is provided by a wire 
spring or an elastic string. It was first shown by Robert Hooke (1635-1703) 
that if a spring is extended from its natural length but remains within its 
elastic limits, the tension 7 is proportional to the extension x: 


T= kx, 


where & is a constant of the particular spring, known as its stiffness.t 
(If, in the case of a spring, x is negative, then so is 7: the spring is under 
compression and is exerting a negative tension, or thrust.) 
When such a spring is extended, we see that 7 and x are in opposite 
directions, so the work done by the tension T = —kx during this extension is 
x 
—kx dx = —[$kx?]>o = —skx’, 
0 


whose magnitude can also be found from the force, distance graph: 


T = —kx 


{ If the natural length of the spring (or string) is J, then k is sometimes written as A/l, where A 
is called its modulus of elasticity. Hence T = Ax/l, and the corresponding elastic energy is 


Ax? /21. 
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Potential energy 


Finally, we note that if our loaded elastic spring with extension x were to 
return to its natural length, the tension would clearly do a positive amount 
of work $kx”. This, therefore, is called the elastic potential energy of the spring, 
being its stored capacity for doing work. 

Similarly, if a mass m is at a height ἡ above a given base-level, its weight 
mg would do an amount of work mgh if it were to return to its original 
level. This, therefore, is called its gravitational potential energy (referred to the 
given base-level) when at height A. 

We therefore see that, for any displacement, the loss in potential energy 
(p.e.) is equal to the work done by the weight and so is equal to the gain in 
kinetic energy (k.e.): 


loss in p.e. = gain in k.e. 


Example 2 


An elastic spring of unstretched length 1 m hangs from the ceiling of a 
room whose height is 3m. When a 1 kg mass is attached, the spring 
extends to a length 2 m, and it is then pulled down further, so that the mass 
touches the floor. If it is then released, find its velocity at heights 1 m, 
2 m above the floor, and calculate the kinetic energy, gravitational poten- 
tial energy, and elastic potential energy in all three positions. 

Let the stiffness of the spring be k. Then, taking g = 10m 5. ἡ, the 
weight of the mass will be 10 N 


and kx 1=10 => k= 10. 


If we measure depth x from the unstretched position, we can now 
investigate: 


(1) The displacement from x = 2 tox = 1. 
Work done by weight = —10 x 1 = —10J. 
10x. 


1 
| — 10x dx 
2 


= [5x7]7 = 15 J. 


Also tension in spring 


So work done by spring 


Furthermore, if velocity is v when x = 1, gain in k.e. = 4x1 x v?. 
So —10 + 15 = jv’ 
=> v? = 10 


=> y= /10=3.2ms'. 
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(1) The displacement from x = 2 tox = 0 
Work done by weight = —10 x 2 = —20 J. 


0 


Work done by spring = | — 10x dx 


2 


2 
= [5x2]? = 20. 
So if velocity is V when x = 1, gain in k.e. = ἔχ 1 x V? = 5V?. 


So --20 τ 20 - ἐγ’ 


> V = 0. 
Finally, if we measure gravitational p.e. from floor level and elastic p.e. 
from the unstretched position (so that elastic p.e. = 4 x kx* = 5x*), we 


can express the three forms of energy in the following table: 


Gravitational Elastic 
Depth from ceiling/m p.e./J p.e./J k.e./J 
1 (x = 0) 20 0 0 
2 (x = 1) 10 5 5 
3 (x = 2) 0 20 0 


So we note that at each level the sum of the three forms of energy is 20 J. 

Hence, in both our examples, the total energy of each system remains 
constant. More generally, this is true in any system, provided that the work 
done by its forces to achieve any particular position 1s independent of the route by which 
this position 15 achieved. 

The general result is then known as the principle of conservation of energy. 

It is, however, necessary to issue a warning that not all systems are as 
well-behaved (or ‘conservative’) as these. If, for instance, we consider a mass 
being moved across a rough table from a fixed starting point to a particular 
position, it is clear that the work done by friction is entirely dependent on 
the route taken. So friction cannot have a potential and 15 a non-conservative 
force, so that it is impossible to have a principle of conservation of mechani- 
cal energy in this case. We shall meet many similar instances, such as the 
impact of particles which are not perfectly elastic, when the total mechani- 
cal energy of the system is seen to diminish, though this is accompanied by 
the production of other forms of energy, such as heat or sound which con- 
serve the overall energy of the system. 


Exercise 11.8b 


1 Ifaspring has stiffness 200 Nm ~’, find: 
1) the tension needed to extend it by 4 cm; 
11) the force required to compress it by 5 mm; 


111) its extension when pulled out by a force of 40 N; 


10) its compression when pushed in by a force of 10 N. 


( 
( 
( 
( 
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2 In each case of no. 1, find the elastic energy stored in the spring. 


4 An elastic string is extended 2 mm by a tension of 4 N. Find: 
(1) its stiffness; 
(11) its elastic energy. 


4 Aspring is compressed a distance of 1 cm by a force of 500 N. Find: 
(1) its stiffness ; 
(11) its elastic energy. 


5 An elastic string has natural length 2 m and stiffness 50 N m7’. Find: 
(2) how far it stretches when a mass 20 kg hangs from it; 

(11) its elastic energy in this position; 

(222) the work done by its weight if the 20 kg was attached to the end of the 
string when unstretched and gently lowered into its equilibrium position. 
(Take g > 10 πὶ 52.) 


6 The particle of mass 20 kg in no. 5 is attached to the string and dropped 
vertically from its point of suspension. When the string has become taut 
and is extended by a further amount x, find: 
(1) the loss in p.e. of the weight; 
(11) the elastic energy of the string; 
(111) the k.e. of the particle. 

Hence find the maximum extension of the string. (Take g ¥ 10 ms 7.) 


7 A-nylon climbing rope of length 40 m is stretched by a force F. The 
relation between the force F and the corresponding extension x is given in 
the following table: 


xm 0 1 2 3 #4 5. 6 7 8 9 10 


F/N 0 1600 2400 3000 3600 4100 4600 5200 5700 6400 7900 


Use Simpson’s rule to estimate the energy stored in the rope when it is 
extended by 10 m. 

A climber of mass 64 kg is climbing on a vertical rock-face ἡ m above 
the point to which he is securely attached by the rope of length 40 m 
described above. The climber slips and falls freely, being brought to rest 
for the first time by the rope when it has been extended by 10 m. 

Calculate the value of / to the nearest metre. 

Determine also the velocity of the climber at the moment the rope 
becomes taut, and hence, or otherwise, calculate the average deceleration 
of the climber from that moment until he comes to rest. (Take g = 9.8ms° 2.) 

(J.M.B. ) 


8 Ifthe radius of the Earth 15 A, the gravitational attraction on a body of 
mass 1 kg at a distance x from the Earth’s centre is 


R? 
ifx > R: τ ifx « R. 
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Hence find the work done by this force (or weight) : 
(1) if the body came to the Earth’s surface from a very great distance; 
(11) if the body could move from the Earth’s surface in to its centre. 


g From no. 8 find: 
(1) the minimum speed with which a body from outer space would 
approach the Earth; 
(11) the speed with which a body would reach the centre of the Earth if it 
could be dropped from its surface down a vertical shaft. (Take g ¥ 10ms~?, 


R=64 x 10° m.) 


Work as a scalar product 


So far, we have restricted ourselves to the work done by a force when its 
displacement is in its own direction, but we are now able to examine the 
more general case. 

If a force F undergoes a displacement 5, the work done is defined as the 
scalar product F.s. We therefore see that work is a scalar of magnitude 
Fs cos θ (where @ is the angle between F and 5) and so can be regarded as 


f xX component of s along F 


or s X component of F along s. 


In particular: 
(4) when@=0, work = +Fs; 


— ee ο-. Ε 


(1) when @ = 72, work = —Fs; 


--:---ο ἐἠἠ Ἑ ἐὁἡὁὍὁ» 


5 
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(11) when θ = 5x, work = 0. 


5 


F 


Hence the work done in any displacement perpendicular to the force is 
always zero. In particular, the work done by a normal reaction at a smooth 
surface 1s always zero. 

Moreover, we have already seen (in 9.5) that the operation of scalar 
multiplication is distributive with respect to addition. 

Hence F,.s + F,.s = (F, + F,).s and the sum of the amounts of 
work done by two forces is equal to the work done by their resultant. 

Similarly, F.s, + F.s, = F.(s, + 5.) and the sum of the amounts 
of work done by a force in two displacements is equal to the work done by 
the force in the resultant displacement. 

Furthermore, if F is a variable force, the work done in a small displace- 
ment 0s will be dW, where 


OW = F.os. F 
aa! / 
ὃ a ~ 
Hence the total work can be written as rs NK yg 
ye ὃς ri 
W = lim F.ds = | Fas 7 YS / 
os 


Alternatively, if the velocity and acceleration of the particle are v, a 
respectively, we can write 


OW ὃς 
— = Ff ,— zw F. 
ot ot ᾿ 
dw 
— = F,v. 

Te oa ἕ 


But, by Newton’s second law, 


and again we see that the work done by a force is equal to the increase in 
kinetic energy. 


11.8 WORK, ENERGY, AND POWER 229 


Example 3 


A particle of mass 6 kg slides from rest down a rough plane which 15 inclined 
at 30° to the horizontal. If the resistance to motion is 12 N throughout, find 
the work done by the various forces when the mass has moved a distance 
8 m and hence find its velocity at this point. (Take g * 10 τῷ 5.2.) 


R 


12N 


60 N 


The three forces on the particle are: 
(1) its weight6 x 10 = 60N; 
(11) the normal reaction R; 
(111) the resistance 12 N; 
and the amounts of work done by these in the displacement 8 m are: 
(1) 60 x 8 x cos 60° = 240 J; 
(11) zero (since force is perpendicular to displacement) ; 


(111) -—12 x 8 = —96J. 
Hence total work = 144 J. 
But final k.e. = 5 x 6 χυἷ = 30”. 


So 144 = 3’ 
1 


=> v=48 => v=69ms_!. 


Exercise 11.8c 


1 Find the work done (in SI) when a force t + 37 + Kk acts on a particle 
and displaces it by an amount: 

=e 2p ΤΆΞΑΣ (11) 34 — 2) — k; (11) ἃ -- Κ. 

2 Acar of mass 800 kg is travelling at 30 τὰ 5. 
off when it comes to a small hill. Find: 

(1) its k.e. at the foot of the hill; 

(11) the maximum height of hill which the car can surmount (ignoring all 
resistances). 


with its engine switched 


3 A boy slides down a smooth chute at a swimming pool. If its top is 6 m 
above the water and its bottom is 1 m above the water, find his speed: 

(1) on leaving the chute; 

(11) on entering the water. 
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4 A box of mass 10 kg is attached to the end of a rope of length 5 m 
and hangs freely. If it is then given a horizontal impulse of 100 N s find: 
(1) its initial speed; 

(1) its initial kinetic energy; 

(111) the height to which it will rise. 


5 A marble of mass 20 g rests at the top of a smooth circular tube which 
has diameter 1 m and is in a vertical plane. If the marble is just displaced 
from rest, find its speed: 

(¢) when travelling vertically at the end of the horizontal diameter ; 

(1) when passing through the lowest point of the tube; 

(11) when 25 cm above the lowest point. 

Hence find the reactions of the tube on the marble at these three instants. 


6 A boy of mass 40 kg swings on a rope of length 2 m which is just taut 
and attached to a point at the same height. When the rope is vertical, find: 
(1) the kinetic energy of the boy; 

(1) the speed of the boy; 

(111) the tension in the rope. 


7 A commando swings on a rope fixed at its upper end A. The rope, 
initially inclined at 60° to the downward vertical, remains taut and may 
be considered as inextensible and of negligible mass. The commando can 
be idealised as a particle at the end of the rope. If: 
(1) his mass is 70 kg; 
(11) the rope is 10 m long; 
(1) he lets go when the rope is vertical ; 
(w) Ais 12 τῇ above the ground; find his landing speed. 
(Neglect air resistance, and take g = 10m amar) 

Which of (2), (11), (11), (1) can be varied, one at a time, without affecting 


this speed? (S.M.P.) 


8 <A particle of mass m is slightly displaced from the top of a smooth 
hemispherical dome of radius a. If its speed is v when the radius to the 
particle makes an angle θ with the vertical, find: 

(1) an equation connecting v with a, g and 0; 

(11) the normal reaction N in terms of a, g and 0; 

(11) the value of 6 for which N = 0 (i.e., at which the particle leaves the 
surface). 


Power 


We know from everyday experience that when a certain task can be 
accomplished by either of two forces, it frequently happens that one of 
them can do the necessary work more quickly. Such a force is then said to 
have ‘greater power’. 

More precisely, we define the power P of a force as its rate of doing work: 


P = ἀν αι. 
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But we have already seen that when a force F is moving its point of 
application with velocity v, 


dw 

ἘΞ ὦ = fF, 

di . 

Hence | ls = 
dt 


So the unit of power in SI is 1 joule per second and is called 1 watt (W): 


1W=1Js7! 


Example 4 


In British units the standard unit of power was | horse power (h.p.), being 
the rate of working required to lift a mass of 550 pounds through | foot in 
Ι second. 

Taking 1 lb = 0.454 kg, 1 ft = 0.305 m, g = 9.81 ms‘, find 1 h.p. in 
kilowatts. 


In 1 5, a force working at a rate of 1 h.p. will lift 550 10 a height of 1 foot, 
1.6., 550 x 0.454 kg a height of 0.305 m. 


So work done in Ϊ s 
= (550 x 0.454 x 9.81) x 0.305 J 


= 746 J. 

Hence lh.p. = 746Js~' = 746 W 
= lh.p. = 0.746 kW. 
Example 5 


Find the power of a pump which raises water from a lake at a rate of 


5 τὴ 5. ' and delivers it 6 m higher at a speed of 8 ms’. 


In 1 5, the mass of water moved = 5 x 10° kg. 


So weight of water moved = 5 x 10° x 9.81 N and the work done in raising 
it through 6 m 


= 5x 10° x 9.81 x 6 = 2.94 x 10° J. 


But this water is also given an amount of kinetic energy 

-- ἔχ (5 x 109) x 8? =1.6 x 10° J. 

So total work done in 1 s = 4.54 x 10° J and the power of the pump 
= 4.54 x 10° W 

= 454 kW. 
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Exercise 11.8d 


1 A pump delivers water at the rate of 900 kg min‘ from a reservoir 
to a nozzle 8 m above the surface of the reservoir. The water emerges from 
the nozzle with a speed of 12 πὶ 5. *. Calculate the power, in kilowatts, of 
the pump. (c.) 


2 A pump raises 1 000 kg of water each minute from a depth of 20 m 
and delivers it at a speed of 8 ms *. Assuming that the efficiency of the 
pump 15 40%, calculate its power. (ο.α.) 


3 A railway train has mass 1.4 x 10° kg and the maximum power of 
the engine is 1.8 x 10° W. Calculate the maximum acceleration, in 
τ 5. *, when the train is travelling on the level at a speed of 12 πὶ 5. ᾿, the 
total resistance being 9.6 x 1074 N. (c.) 


4 Acar of power 50 kW has mass 800 kg, which is equally distributed 
over the four wheels. The coefficient of friction between the two driving 
wheels and the road is 0.3. If the road is level, and resistances to motion 
can be neglected, find the maximum acceleration of the car (in πὶ 5. 7): 
(1) at20kmh™'; 

(1) at l1Okmh™?. (ο.α.) 


5 An escalator takes passengers from a lower level to a higher level 
through a vertical height of 20 m. On average two passengers get on the 
escalator per second and two leave at the top per second. If the average 
mass of a passenger is taken to be 75 kg and the machinery has 60% effici- 
ency, calculate, in watts, the average power output of the motor which 
drives the escalator. (Take g to be 9.8 ms °*.) (c.) 


6 Acar of mass 1 000 kg is moving on a level road at a steady speed of 
100 km h~' with its engine working at 60 kW. Calculate in newtons the 
resistance to motion, which may be assumed to be constant. 

The engine is now disconnected, the brakes applied, and the car comes 
to rest in 100m. Find the additional retarding force arising from the 
application of the brakes. 

If the engine is still disconnected, find the distance the car would run up 
a hill of inclination sin ! τς before coming to rest, starting at 100 kmh“, 
when the same resistance and braking force are operating. (c.) 


7 Sand pours over the lip of a chute and falls vertically onto a conveyor 
belt which is moving horizontally at a steady speed of 0.5 ms ΄. If the 
supply of sand is steady at the rate of 50 kg 5. ἷ, what horizontal momen- 
tum is given to the sand each second? Find the force that is required to 
give the sand this momentum. 

If 10 W is the power required to operate the conveyor belt without a 
load, find whether a motor of maximum power 25 W can deal with sand 
at the rate of supply of 50 kg 5.΄. 

If the chute were modified so that the sand was delivered with a hori- 
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zontal component of velocity of 0.5 πὶ 5. in the direction of motion of 
the belt, find the maximum speed at which the belt could move the load, 
assuming that the power required for the belt itself is unaltered and the 
sand is delivered at the same rate as before. (M.E.I.) 


8 A VTOL aircraft hovers by taking in air through a large intake and 
ejecting it at high velocity downwards. If the relative velocity of ejection 
is ums‘ and the mass of the aircraft is M kg, find expressions for: 

(7) the mass of air ejected per second for hovering ; 

(11) the power required to hover. 

Later, when the aircraft is flying as a normal aeroplane, the engines work 
at half this power and the air resistance varies as the square of the speed. 
The maximum steady speed attainable in level flights is Vms~ *. Show that 
at a speed of } Vms_ ‘ the engines would supply enough power for the air- 


craft to climb in normal flight at a rate of 370/256 τὴ 5. ΄. (M.E.L.) 


g Acar of mass m started along a horizontal road by an engine which 
works at a constant rate P. Ignoring all resistances to motion and letting 
the distance travelled, velocity and acceleration after time ἐ be x, v, a 
respectively, find equations which connect: 


(1) a,v; (11) v, ts (122) v, x5 (10) ree 


10 An engine is fitted to a vehicle of mass m kg. It is known from tests 
that it will give constant tractive force F N up to a road-speed of U m 5. 7 
and at higher speeds will give constant power FU watt. Show that at speed 
vms ' (v > U) the tractive force is FU/v N. Form the appropriate 
equations of motion for the vehicle if the resistance to motion at all speeds 
is given by kv? N, & being constant. 

Show that there is a maximum attainable speed V ms _°* and find an 
expression for V in terms of k, U and F. Given that m = 500, F = 2 500, 
U = 25 and V = 50, determine the value of k. Calculate the acceleration 
at each of the speeds 5, 15, 25, 35 ἢ 5. 1 and hence estimate the time taken 
to reach 40 m s_’ from rest, giving your result to the nearest second. 


Explain your method. (M.E.1. ) 


11.9 Oscillations 


In 5.10 we spent a little time studying simple oscillations like alternating 
currents, together with their amplitudes, periods and frequencies. We can 
now take this investigation further, and shall begin by considering three 
other ways in which such oscillations can arise. 


Example 1 


If a spring of stiffness k hangs vertically from a fixed point, with its other 
end attached to a mass m, investigate its equation of motion at a point when 
it is pulled down a further distance x. 
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Unloaded Loaded and Loaded and 
in equilibrium in motion 
(extension = a) (extension = a + x) 


mgs 


By Hooke’s law, when extension = a, the tension = ka. So, since the 
mass is then in equilibrium, ka = mg. 
Also, when extension = a + x, tension = k(a + x). 


So the general equation of motion is 


mg — kia + x) = mk. 
But ka = mg. 
So —kx = mi 
᾿ k | 
=> A= =x, 
m 


Hence the acceleration of the mass is proportional to its extra displace- 
ment and, of course, in the opposite direction: the tension in the spring 
acts as a restoring force, attempting to re-establish equilibrium. 


Example 2 


Investigate the equation of motion of a cylindrical canister buoy, bobbing 
up and down at sea. 

We shall suppose that the buoy has constant cross-sectional area A and 
height A, and that in equilibrium it is submerged to a depth d. If the density 
of the buoy is p and of the sea 1s o, we can investigate its motion when it is 
submerged by a further amount x. 

In each case, both of equilibrium and of submersion by a further amount x, 
there are just two forces acting: 


(1) the weight of the buoy. 
Now its volume = AA 

SO its mass = Ahp 
and [105 weight = Afpg; 
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in equilibrium further submersion x 


Adog 


a Ee A(d + x) ag 


Ahpg 


(ii) the buoyancy, or upthrust of water on the buoy, which Archimedes 
showed to be equal to the weight of the water displaced. 
Now in the equilibrium position: 


volume of water displaced = Ad 
=> mass of water displaced = Ado 
= weight of water displaced = Adog 
=> buoyancy = Adog; 


and after submersion by a further amount x, the buoyancy is A(d + x)og. 
At the equilibrium position, we see that 


Adog = Ahpg 
hp; 


Ι 


=> do 


and at the more general position, using Newton’s second law, 


Ahpg — A(d + x)og = Ahpx 


=> —Axog = Ahpk 
| eo 
=> k= Bh x. 


So, again, we see that the acceleration of the buoy is proportional to its 
displacement from the equilibrium position and in the opposite direction. 

As our final example, we now investigate an oscillation which, unlike 
the previous cases, is not along a straight line. 
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Example 3 (Simple pendulum) 


A mass m is suspended from a fixed point by means of a light inextensible 
string of length / and swings to and fro in a vertical plane. 


When the string makes an angle θ᾽ with the vertical, the acceleration of 
the mass has two components: 


19 perpendicular to the string, 
and 16? along the string. 


Furthermore, there are just two forces acting: 


(1) the weight mg, 
and (11) the tension 7 in the string. 


Hence, by Newton’s second law and resolving perpendicular to the string: 
—mg sin 0 = ml6 
= ὃ ΞΞ ᾿ sin 6. 


In this case, therefore, 8 is proportional not to 6, but to sin θ. However, 
if the string is swinging through a small angle, we know that sin 0 ~ 8. 


Hence ὃ x ἐξ. θ. 


Simple harmonic motion 
In all three examples the final equation of motion was of the form 
* = —kx (k > 0), 


which is evidently of considerable importance. Any motion which can be 
modelled by such an equation is called simple harmonic motion (or s.h.m.). 
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As k > 0, we can let k = w? (so that ὦ is another constant) and obtain 
the equation: 


This proves to be a more convenient form of the equation and 1s therefore 
adopted as standard. 
We can now proceed to find its solutions by either of two methods: 


Method 1 
R= —w*x 
—> WK = —Iw7xx. 


If we now integrate with respect to ¢, it follows that 
x2 = —w*x? + A, 
Supposing that when x = 0 the value of x is a 
(i.e., that a is the amplitude of the oscillation), it follows that 
07 = --᾽ὠαὶξὰ2 + A 


=> A= oa’. 


So x? = w*(a* — x?) 
dx 
Hence — = +@,/(a* — x?) 
dt 
dx 
+ = ὦ αἱ 
Ξε -- Ce _ x") 
x 
=> + sin” -- = wt +e (where 8 is a constant angle) 
a 
x : 
=> -ετὁΞ +sin (οὐ + &) 
a 
=> x= +asin (wt + 8). 


Now —sin (@t + δ) = sin (wt + €+ πὶ, 
SO x =asin(mt+eé) or asin (οἱ - 8 -Ὁ 7). 


But é 1s itself an arbitrary constant, so the second of these general solutions 
is the same as the first. 

Hence x = asin (wt + €) and any simple harmonic motion can be 
described by an ordinary sine oscillation. The constant 8 is called the angle 
of phase, or simply the phase. 

Moreover, sin (wi + 8) takes the same value when qt is increased by 
27, 1.e., when ¢ is increased by 2πίω. 


So the period of the s.h.m. is 22/q@ (and its frequency is w/2m). 
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Summarising, we see that the s.h.m. has the properties: 


daa 
a? (a? — x?) 


= asin(q@it + 6&) 


2 
amplitude = a; period = =; phase angle = 8 


Method 2 

Alternatively, ἃ = —w*x can be written 
a + w*x = 0 

a a ek 0; 

dé? 


Now x = A e™ is a solution, provided that 
m+o>=0 > m= tio. 
Hence the complementary function 15 


y= A εἷοι aie Be ἴοι 
= A(cos wt + isin wt) + B(cos wt — 1 sin Οἱ) 
= (“ἢ wt + D cos οἱ, 


where C, D (like A, B) are arbitrary constants. 


If we now choose a, € so that 


acosgé=C 


asine = D, 
it follows that 


x = a(sin wt cos € + cos wt sin €) 
> χἁ =asin(@i + 6&). 


Hence ἃ = am cos(wt + &) 


and it follows immediately that 
x2 = w?(a* — x), 


In particular, we can now calculate the periods of the oscillations in the 
above examples: 


Spring of stiffness k supporting particle of mass m: 


ee jod = ῶπ |= 
O= ΙΝ, => period = 2m [1 


Buoy of density p in sea of density ¢: 


σ h 
pec = period = 27 SS 


ΟΝ ρῆ og 


Simple pendulum of length 7: 


l 
ω = i; => period = 27 fe 
g 


Example 4 
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A particle is describing simple harmonic motion in a straight line. When 


its distance from O, the centre of its path, is 3 m, its velocity is 16 ms” 


towards O and its acceleration is 48 ms 2 towards O. Find: 


(1) the period of the motion; 
(11) the amplitude of the motion; 


(11) the time taken by the particle to reach O; 
(1v) the velocity of the particle as it passes through O. 


With the usual notation, = —w?*x. 
But when x = 3, ἀπ᾽ —48. 


So —48 = —307 => w= 4. 


Hence period = = ae 7.57 ὁ. 


" 
2 
(1) Furthermore, χ = w*(a* — x7) 
But when x« = 3, x = —16. 

So 256 = 16(a* — 9) 


> @—-9=16 => 445-58. 


So amplitude = 5 m. 


1 


(1) Moreover, measuring ἐ from the instant when x = 0, we know that 


= asin wt = 5 sin 4. 


So letting x = 3, we see that 


3=5sin4t => 4¢ = sin ' 0.6 = 0.6434 (= 36° 52’) 


=> {= 0.16s. 
So time taken = 0.16s. 
(wv) Finally, χ΄ = 16(25 — x7) 


and putting x = 0 
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we obtain *7 = 16 x 25 
=> x=4x 5-= 20. 


So velocity of particle as it passes through O is 20 ms_'. 


Example 5 


A particle of mass m hangs at rest from the end of a vertical string which is 
attached to a fixed point and stretched by an amount e. Ifit is then given a 
further extension a and then released, find the period of its subsequent 


oscillations if (1) « « 9: (1) α » 6. 
Natural Equilibrium Starting General 
length position position position 


Let the stiffness of the string be k. Then, by considering the position of 
equilibrium, 


mg=ke > k=—- 


At the general position, applying Newton’s second law: 
me — k(e + x) = mk 
=> —kx = mk 


k g 


=> X= ----|ᾷί Ξ — x. 
m 6 


So the particle executes simple harmonic motion with w? = g/e, provided 
only that the string remains taut. It is therefore now necessary to dis- 
tinguish two cases: 


(1) a<e. 


In this case, the string remains taut, so the period of the oscillation 15 


2m ΝΕ 2π τα | 
o  /g/e g 
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(2) a> 4. 


In this case the string becomes slack as when x = —e the particle is still 
travelling upwards. So the full cycle consists of three parts: 


(a) Particle below equilibrium position (x > QO): 


π 6 
time taken = — = π [-- 
ω re 


(ὁ) Particle above equilibrium position but string still taut (—e < x « 0): 


as the motion is simple harmonic, and if we measure time from the instant 
of passing through the equilibrium position, 


x = —asina@t = —asin Ie) ἕ 
6 


Putting x = —8, 


—e = —asin Je): > {= Ke) τς ee 
2 g a 
€é. _.€ 
So time taken = 2 i; sin 1-- 
g a 


(c) String slack : 
The string becomes slack when x = —e. 
But x? = w?(a? — x’). 


Bs. τ 
So at this point χα = fee. 
6 


Hence time taken with string slack 


Dx 2 42 
-%ao/ ; 
ξ &e 


So total time taken 


wn fora ftintty2 f2=! 
g ga ge 
2 2 
- [ifn resin τε... 55 τὰ 
g a ἢ 


Exercise 11.9 


1 A spring has natural length 4 m and stiffness 20 N τὰ ᾿ς One end is 
attached to a fixed point O of a smooth horizontal plane and the other end 
to a particle of mass 5 kg which is extended by a further distance of 3 m 
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and then released to oscillate in a straight line. After time ¢ the extension of 
the spring is x. Taking g = 10 ms”, find: 

(2) the equation of motion of the particle; 

(11) the period of its oscillation; 

(111) x in terms of ¢; 

(10) the maximum speed of the particle. 


2 A vertical light spring has two equal masses fixed to it at its ends, as 
shown. When the lower mass lies on a table, the spring is compressed a 
distance d by the weight of the upper mass. If the upper mass is now pushed 
down a further distance 2.5d and then released, examine whether the 
lower mass will at some time leave the table. 

(You may assume that if the lower mass were fixed, the upper mass 
would oscillate symmetrically about its equilibrium position.) 


3 A boat is tossing (moving vertically up and down) ins.h.m. so that at 
one extreme position the weight of a passenger appears to be halved. 

(1) What is the ratio of apparent weight to actual weight at the other 
extreme? 

(11) Ifthe total range of motion is 10 m, what is the period of a complete 
oscillation? 

(111) What is the ratio of the volumes of water displaced by the boat at the 
two extremes? 

(1) During what fraction of the time is the apparent weight within 10 per 
cent of the real weight? (M.E.I.) 


4 A particle of mass 2 kg is hung from the end of a vertical string and 
stretches it by 10 cm. Find the period of the oscillation if it is then pulled 
down a further distance of: 

(1) 5cm; (1) 15 cm; 

and then released. (Take g 10 ms *.) 


5 Oneend O ofa light elastic string OA, of natural length 4a and modulus 
3mg, 15 attached to a fixed point of a smooth horizontal table. A particle of 
mass m 15 attached to the other end A. The particle is pulled along the table 
until itis at a distance 5a from O and is then released with the string straight. 
Show that the particle first reaches O after a time 


a 


(π + 8) 3g (ο.α.) 


II1.1I0 GENERAL STATICS 237 


6 Amanstands ona horizontal rough plank which performs a horizontal 
simple harmonic motion of period T and amplitude a. The coefficient of 
friction between the man and the plank is μ. Show that he will not slip on 
the plank if 


ugT* > 4n*a. (ο.α.} 


7 Two particles of equal mass on a smooth horizontal table are connected 
by an elastic string of natural length a and modulus of elasticity equal to 
the weight of one particle. If the particles are held at rest at a distance 3a 
apart and are then released simultaneously, find the time which elapses 
before they collide. 


8 A particle P, of mass m, is fastened to one end ofa light string, of natural 
length a and modulus of elasticity 4, whose other end is fixed at O. 
A particle Q, also of mass m, is fastened to one end οἵ ἃ light inextensible 
string of length a whose other end is fastened to P. Initially the system is at 
rest with OPQ in a vertical straight line and OP of length a, so that P 
is at the middle point of OQ. The system is now released. Prove, by con- 
siderations of energy or otherwise, that the length of OP is never less than a. 
The length of OP at time ¢ is a + x. Prove that, 


A d* 
a μ’ Ξ San then az + n?x = g. 
Show that 
5 


x = + (1 — cos nt) and deduce the tension at time ¢ in the string PQ. 


᾿ (α.) 


g A particle of mass m hangs at rest from a light inextensible string of 
length /. The particle is suddenly struck a small blow of impulse mz, per- 
pendicular to the string. Find the amplitude of the subsequent motion and 
the time taken for the particle to rise to a height equal to one-half of its 
greatest height, measured from the initial position. (M.E.1.) 


10 A particle of mass m is attached to the mid-point of a stretched light 
string whose tension is 7 and whose ends are fixed to two points distance / 
apart on a smooth horizontal plane. Show that if the particle is moved a 
distance x perpendicular to the string, the restoring force is 27 sin 6, 
where θ is the angle which the parts of the string make with their original 
line. 

Hence find the equation of motion when θ is small, and the period of 
oscillation. 


*11.10 General statics: moments, couples, and equilibrium 


So far we have limited our study to that ofa single particle. But we are often 
concerned with the behaviour of collections of particles, both at rest and 
in motion. These might range from the particles of a highly compressible 
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gas injected into a turbine to those of water, nearly incompressible, 
in waves at sea. But the simplest such collection, with which we shall be 
principally concerned, is one in which the distances between every pair of 
particles remains constant, and this we know as a rigid body. First, however, 
we must develop the idea of moment of a force. | 


Moments of forces 


When we considered a particular force, of known magnitude and direction, 
acting on a single particle, there was never any doubt about its line of action: 
it necessarily had to pass through the particle. But if we now consider a 
collection of particles, say a chair, it is a common experience that a par- 
ticular force can have a very different effect if its line of action is shifted: 
the reaction of a man on a chair might in one position produce stability, 
whilst if exactly the same force acts at a different position the chair might 
topple. In the case of a force acting on a single particle it was unnecessary 
to consider the turning effect of the force, because it had none. But when we 
consider a collection of particles, we must clearly study the turning effects 
of the individual forces involved. 

To take a simple example, let us suppose that a beam of length 2 m is 
freely hinged at one ofits ends O so that the beam is in a horizontal position, 
and let us further suppose that a force of magnitude 1 000 N acts vertically 
upwards at its other end. 


1 000 N 


O 2m 


Now it is clear that the force of 1 000 N exerts a turning effect about O, 
and we know that this effect can be altered 
(a) by increasing, or decreasing, the magnitude of the force; 
(b) by altering its direction; 
(c) by altering the position of its line of action, perhaps by moving it 
nearer to O. 

The reader will already be familiar with a measure of such a turning 
effect called the moment about O of the given force, which is influenced by such 
changes. This is usually defined as the product of the magnitude of the 


force and its perpendicular distance from O: 
F 


moment about O of F = pF. 
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Let us now consider what will happen if we radically alter the direction 
of the force. For instance suppose that instead of acting vertically, the force 
remains perpendicular to the rod but acts in a horizontal direction: 


1000 N 


It is immediately apparent that such a change alters the turning effect 
from being in a vertical plane to being in a horizontal plane, as shown by 
the arrows in the two figures. 

We therefore see that a turning effect is defined only when we know not 
only its magnitude and point of application but also the plane in which it is 
acting. A bowler, for instance, may give equal turning effects to three 
successive balls, one with top spin, the next as an off-break and the last as a 
leg-break: their distinguishing features are the planes in which they are 
made to spin. Now each of these planes is perfectly defined by the direction 
of its normal so we see that for the turning effect to be fully defined we need 
to know both a magnitude and also a direction, that of the normal to the 
plane in which it acts. It is, therefore, no surprise that the moment ofa force 
should emerge not as a scalar quantity, but as a vector. 


Moment as a vector product 


A force and its line of action are completely defined by: 

(1) avector F representing the force; and 

(11) the position vector r of any point on its line of action. 
Let us now consider the vector product r Χ F. 


F 


First, we notice that the vector r x F does not depend on the particular 
point which was chosen on its line of action. For suppose that, instead of the 
point P (with position vector r), we had chosen another point P’ (with 
position vector r’): 
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Now PP’ is in the same direction as F. 


So PP’ = ΔΕ, where Aisa scalar 
=> r'—r=)F 

=> (γ΄ —r) xX F=)FxX F=0 

— rx F=rx F. 


Hence the vector product r X F is independent of the particular point 
chosen on F and depends only on F and its line of action. 

Furthermore, if the angle between r and Fis 0, we see that the magnitude 
ofr xX Fis 


rF sin @ = (rsin 0)F = pF. 


Lastly, the direction of r X F is perpendicular to r and to F and so 1s 
perpendicular to the plane containing the line of action and O. 

In summary, it is evident that r Χ F depends only upon the vector F 
and its line of action, that its magnitude is the product of F and its per- 
pendicular distance from O, and that its direction corresponds to a turning 
effect in the plane containing F and O. We therefore now re-define the 
moment about O of the force F as the vector r Χ F: 


moment about O of the [το F=rx F 


In particular, we notice that if F goes through O, its moment about O is 
Ox F=0. 
Also r X (—F) = —(r Χ δ), 


so that the moments about O of two equal and opposite forces are them- 
selves equal and opposite. 
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More generally, an extremely valuable result follows directly from the 
fact that vector products are distributive with respect to addition. For 
suppose that P, Q are two forces which meet in a point and that their 
resultant is R. Let the position vector of their point of intersection be r. 


Then rx P+rxQ=rx(P+Q)=rxXR. 


So the sum of the moments about any point of two concurrent forces 


is equal to the moment of their resultant. 


Equivalent systems of forces 


It sometimes happens that two apparently different systems of forces have: 
(1) their vector sums equal; 
(11) their moments equal about every chosen point. 

In such a case we say that the two systems are equivalent. In particular, 
the resultant of two concurrent forces has the same moment about any point 
as the sum of the moments of its constituent forces. So the resultant of two 
concurrent forces is equivalent to the forces themselves. 

It can now easily be shown that two sets of forces are equivalent if: 

(1) their vector sums are equal, and 
(ii) their moments are equal about any one particular point. 
For let us take this point as origin and let the two sets be 


F,atr; (2 = 1,2,...,2) 
and Fiatr; (¢ = 1,2,...,7). 


Then ae ee and Yr, xX F, = Dri x Εἰ. 
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So the moment of the first set of forces about another point ὁ 
=) (r, — ¢) x F; 

=)in x F,-cx) F, 

=UnxF-ex DF 

= Σ (σέ - δ) x δὶ 


= moment of the second set of forces about δ. 


Parallel forces 


The question now arises whether we can find such an equivalent force, or 
resultant, when the two forces are not concurrent but parallel. 

Let us consider two parallel forces k,F and k,F acting through points 
r, andry,. 


Now if a single force is equivalent to the original pair, it must be 


ΚΡ + koF = (ἃ, + ko) F. 


KF 
ry 
eee” (ky + k,)F 
ee 
kF 
ol 


We now take a point r on the line of action of this force. Since it 18 
equivalent to the original forces, we know that 
rx (kj +k )F=r, ΧΑ +1. X koF 
<> (k, thor X F= (kyr, + korg) Χ F. 
In particular, we see that this equation is true if 
(Ay + ky)r = kyr, + kor 
Ν Kir, + kos 


««» 


which is the point dividing the line joining r, and γο in the ratio ky:h,. 
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As O could have been taken at any point whatsoever, it is clear that the 
original forces k,F and k,F are equivalent to a force (k, + k,)F acting 
along a line which separates their lines of action in the ratio ky:k,. So this 
force is again known as their resultant. 


Centre of gravity 


We now suppose that a system of particles consists of masses m,, my, m3,... 
at points 71, 1% ,13,.... 


Then these masses have weights m,g, M28, M32... 


acting at points r,,7%),73.... 


O 


ry, Ts 


Mg 
m8 M3§ 
Hence, by a simple extension of the above result, we see that they are 
equivalent to a weight of (m, + m, + m, +:--)g acting at the point 


MY, Ὕ Mgr + Mrz τί τ 
m, +m, + ms Τ᾿ 


9 


which we have already met as the centre of mass. 

Hence the weights of the separate particles are equivalent to their total 
weight acting through the centre of mass, which is therefore also called the 
centre of gravity of the system. 


Couples 


There is, however, one exceptional case of two parallel forces, when 
k, = —k, and the forces are equal and opposite with different lines of 
action. Such a system is called a couple. 
We now consider a couple consisting of two forces, F atr, and —Fatr.. 
Ifa point P is taken with position vector p, the moment of the couple 
about P is 


(rf, —p) X F+ (r, —p) xX —F 
=r,xXF-pxF-r,xFipxF 
= {Py Fa) XE: 
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F 


@p 


As this is independent of the position of P, it is simply called the moment of 
the couple. Note that the moment of a single force about a point on its line 
of action is zero, so a couple cannot possibly be equivalent to a single force 
and is therefore said to be irreducible. 


Example 1 

Coplanar forces F,, F,, F3; act through points whose position vectors are 
γι. 72. 13 respectively and 

F, = 41 - 3], r, = 31 + 2), 

F, = δὲ — 8, ro = —tt+y, 

Fz; = —21 + 6, rz, = 2ὲ -- 7. 

(1) Show that these forces reduce to a single force and give its magnitude 
and direction. 

(22) Find a vector equation or the Cartesian equation of the line of action 
of the resultant. 


(11) Find the single force through the point r = ὁ + 7 and the couple 
which are together equivalent to this system. (w.) 


(2) Ifthese forces reduce to a single force, it must be 

Fi+F,+ F; =7i+y, 

and so have magnitude «(50 and be in the direction making tan” ' 4 ~ 8° 8’ 
with Ox. 

(72) Ifthe line of action of its resultant passes through r = xt + yj, 


then r X (F, + F, ΤΩ =7r,X Fi +r.xX Fo +r, F; 
=> (xt + yf) Χ (7ὲ +7) = (34 + 27) Χ (4 4+ 37) 
+ (-t+ 7) Χ (δὲ — 8J) 
+ (24 —y) X (—2t + 6) 
=> (x — 7y)k = (1+ 34 10)k = 14k 
x — Ty = 14. 


4 
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(111) Let the required force be F and the required couple be G. 
Again, F=F,+ F,+ F; = 71+ 7. 
Furthermore, 
rx (F, + F, + F;)+G=r,xX Fi +r, xX F, +17; F; 
=> (@+ 7) xX (1 Ὁ 7) + G= 14k 
=> —6k + G= 14k 
=> G = 20k. 


So required force is 74 + 2 and required couple is 20k. 


Exercise II.10a 


1 Find the moments: 


About the point, Of the force, Acting at 
(2) O t+y-—k i—k 
(1) O t+ 7 -- 2k ὁ Ὁ 2) - Κα 
(111) ὦ Ὁ 7 t~+23—k t+y—k 
(10) ἃ 27 — 3k δὲ Ὁ 2) -- 3k 33 +k 
(v) ἃ -- 7) -2ὲ Ὁ 37+ k j + 2k 
(7) 2+y7—k —t+4--—k =f ga 2k 


2 Determine in each of the following cases whether the two sets of forces 
are equivalent: 

(1) ἑ δοῦηρ δι 7 and 7 acting at t, 7 + 7 acting at 27; 

(1) —tacting atyj andj acting at 4, —1 + 7 acting at 27; 
(111) 27 acting at O andj acting at 34, 37 acting at 7; 
(w) 42 acting at O and —? acting at — 32, 3ὲ acting at — 47; 

(v) the couple formed by —¢atj and ἑ at 2), and that formed by —j at 24 
and 7 at 2; 

(vz) the couple formed by —#atj and ἑ at -- 7, and that formed by —j ata 
and 7 at k. 


- 3 Show that each of the following systems of coplanar forces is equivalent 
either to a single force or a single couple (which may be zero). In each case 
find the equivalent force (including its line of action) or the equivalent 
couple: 

(2) t+ yate— 7, 3ὲ — zat 22, 2ὲ + 37 at 27; 

(1) t+gati—y, —3¢+y7ati+ 7, 2ὲ — Wat --ἰ + Q; 

(11) Qjatt — 7, —taty, iat 37, —2j at O; 

(w) 2¢+ 3patz, --ϑὲ + 47 αἱ, 404+ 57 αἱὲ - 7, --ἰ + 3} αἱ —i — 27. 
4 ABCD isasquare of side a. The anti-clockwise moment of a set of forces 
in the plane of the square is 10aP about A, —20aP about B and 10aP about 
C. Taking the x-axis along AB and the y-axis along AD, determine the 
resultant of the set of forces and the equation of its line of action. 
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Determine the couple and the single force at the centre of the square 
which would be equivalent to this resultant. 


5 (1) Show that three forces represented in magnitude and direction by 
kKAB, kBC, ΚΟΑ and acting along the sides AB, BC, CA respectively of a 
triangle are equivalent to a couple of magnitude 2k times the area of the 
triangle. 

(11) Use the result (2) to show that four forces represented in magnitude 
and direction by kAB, KBC, kCD, kDA and acting along the sides AB, BC, 
CD, DA respectively of a plane quadrilateral are equivalent to a couple of 
magnitude 2k times the area of the quadrilateral. (M.E.1.) 


Statics of a system: equilibrium 


Any system of particles (which may, or may not, comprise a rigid body) is 
said to be in equilibrium if, under the forces acting, it can remain at rest. 
Let us consider a system in equilibrium which consists of just three 
particles of masses m,, mz, m3 whose position vectors are r,, fo, 75. 
The forces acting on these particles can be of two kinds: 
(1) external forces, which we call F,, F,, ἕς, acting on the particles 
m,, M5, mM; respectively; and 
(11) internal forces, between the pairs of particles. We let the force exerted 
upon m, by mz, be F,., and the force exerted upon m, by m, be ἔξω, etc. 
So the complete set of forces can be represented : 


F; 


ἔξ, 


Now, by Newton’s third law, every pair of internal forces must be equal 
and opposite. 


So F,.+ F,, = 9. 
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Moreover, since these forces have the same line of action, their moments 
about O must be equal and opposite, and 


r,xF,.+7r.X F,, = 0. 


If we now look at the equilibrium of the particles separately, we see that 


F, + Fi,+ Fi, =0 
F, + Fy; + F,, =0 
ἔς + Foy + Fe, = 0. 


We now add these equations and remember that 
Fi, + ἔς. = 0, etc., 

so that 

Fi, + F,+ F; = 0. 


Hence if a system is in equilibrium the sum of the external forces must 
be zero, just as when they act upon a single particle. 


Moreover, we can also take the vector products of each of the above 
expressions with r,, 72, r3 respectively, and obtain 


r, xX F, +r, X Fi; +r, X Fi, =0 
ro X Fo t+ ry Χ Fy; +r, X Fy, = 0 
r,XF,; +73 Χ Fs. + 13 X Fs, = 0. 


We now add these three equations and remember that 


r,xX Fi, ἜΤΥΧΕ, = 0 
(and r, X Fi, + 7r; X F;, = 0, 
r, X Fo, + r3 X ἔς. = 0). 


Hence r, X Fi, +r. X Fo +17; X F; = 0. 


So the sum of the moments about O of the external forces must be zero. 


Now from the start of this section, O has been any fixed point which it is 
wished to use as an origin. It therefore follows that the sum of the moments 
of the external forces about any point must be zero. 
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For simplicity, we limited ourselves to the consideration of just three 
particles. But the whole of the above analysis can immediately be extended 
to cover any number, finite or infinite. Summarising, we can see that: 


For any system of particles to be in equilibrium: 
(1) the sum of the external forces must be zero; and 


(1) the sum of the moments of the external forces about any fixed 
point must also be zero. 


Example 2 


A uniform beam of length 10 m and weight 2 000 N has a man of weight 
600 N standing on one end. If the beam is supported at two points 2 m 
from each end, find the reactions at these supports. 


600 2 000 


Letting the two reactions (in newtons) be R, S and taking moments 
about C, 


6R = 8 x 600 + 3 x 2000 = 10800 
> R=1800.. 


Also, taking moments about B, 


2 x 600 + 6S = 3 x 2000 
= 6S = 4 800 
=> S = 800. 


So the two reactions are 1 800 N and 800 N (and we note, as we would 
expect, that R + S = 600 + 2 000). 


Example 3 


A lightweight ladder of length 8 m is inclined at 70° to the horizontal, 
resting upon a rough horizontal path and a smooth vertical wall. Ifa man 
climbs this ladder and it begins to slip when he is three-quarters of the way 
up, find the coefficient of friction at its lower end. 
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If we neglect the weight of the ladder itself, there are just three external 

forces acting on the system of the man and ladder: 

(¢) W, the weight of the man; 

(1) §, the normal reaction at the smooth wall; 

(1) R, the reaction at the ground which (on account of a frictional com- 
ponent) can act at an inclination to the normal. 

Now when the system is in equilibrium these three forces must have 
zero movement about every point. But S and W clearly have zero moment 
about their point of intersection, so R must also pass through this point. 

Hence as the man gradually climbs the ladder, the reaction R makes an 
increasing angle with the vertical; until, finally, when the man is three- 
quarters of the way up, this angle becomes A, the angle of friction, and the 
ladder starts to slip. 

If we now look more closely at this limiting position, we see that (in the 
notation of the figure) 


= ὃ sin 70° 
and x = 6cos 70° 


x  6cos 70° 
=> tand=-— 


h 8 sin 70° 
= + cot 70° 

= 3x 0.36 

0.27. 


Ι 


So p = tan ὁ = 0.27. 
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Alternatively, we can let the components of R be F and N, and then 
proceed as follows: 


Resolving vertically, N= W. 
Taking moments about the point P, 


F x 8sin 70° = W x 6 cos 70° 


3W 
=> = πω 70° 


F 
F 
W = 2 cot 70° = 0.27. 


F 


Hence, if slipping takes place at this point, up = 0.27. 


Exercise 11.10b 


1 A beam AB, of length 22 m and mass 150 kg has its centre of gravity 
at C, 11.5 m from A. It is supported in a horizontal position by two trestles 
at D and EF, where AD = 7.5 mand AE = 15 m. Find the pressure on each 
trestle. (Taking g  10ms 2.) 

Find the mass of the heaviest man who can sit at either end of the beam 


without tilting it. (ο.α.) 
Ξ Β 
A 730° 


A uniform rod of length 1 πὶ and mass 50 g is freely hinged at A and has 
its other end resting on a smooth peg B. The rod in this position makes an 
angle of 30° with the horizontal. 

Calculate: 
(1) the reaction at B; 
(ii) the horizontal component of the reaction at A. 
(Take g + 9.81 ms *.) (α.) 


42. A uniform beam ABCD, where AB = a, BC = 3α and CD = α, 15 of 
weight 4W and rests horizontally in equilibrium on supports at B and C. 
Loads of weight xW, yW are hung from A, D respectively. Calculate, in 
terms of x, y and W, the load borne by each support. If the sum of the loads 
borne by both supports cannot exceed 20W, indicate by shading on the 
plane of the coordinate axes Oxy the region within which the point (x, y) 
must lie. Deduce that the greatest value of y is 14. (0.c.) 
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4 Arod AB, hinged at A, is 4 m long and of mass 5 kg, with its centre of 
mass 3 m from A. A rope is attached to B, passes over a smooth pulley 3 m 
above A and supports a mass M hanging freely. A mass m hangs from B 
and keeps AB in a horizontal position. 

(1) When m = 3, find M; and, 

(11) when M = 20, find m. 


5 A non-uniform rod AB, of length a and weight W, rests making an 
angle β with the horizontal with its lower end B on a rough horizontal 
table. The centre of mass of the rod is at G where AG = 3a. Equilibrium is 
maintained by a horizontal string attached to A. Show that the coefficient 
of friction between the rod and the table cannot be less than ξ cot B. Find 
the tension in the string. (0.¢.) 


6 A uniform beam AB rests in limiting equilibrium with A against a 
rough vertical wall and with B on rough horizontal ground. The vertical 
plane through the beam is perpendicular to the wall. The coefficient of 
friction at A is + and the coefficient of friction at B is 5. Calculate, correct to 
the nearest degree, the inclination of the beam to the horizontal. (ο.α.) 


7 Auniform ladder is leaning against a vertical wall on horizontal ground. 
The coefficients of friction at the wall and at the ground are equal, and the 
ladder is on the point of slipping. Show that the total reaction at the wall 
must be at right angles to the total reaction at the ground. 

Draw a figure with the ladder inclined at 60° to the ground; by means of 
a circle with the ladder as diameter, mark on the figure the point where the 
two reactions intersect. (0.C.) 


8 A circular cylinder of weight W is held with its axis horizontal on a 
rough plane inclined at 30° to the horizontal by a cord wrapped round it, 
one end of the cord being fixed to the cylinder, the other leading away 
tangentially, at right angles to the axis of the cylinder. Determine the 
tension of the cord and the least possible coefficient of friction between the 
cylinder and the plane 

(1) if the cord leads away horizontally; 

(11) ifit leads away vertically upwards; 

(111) ifit leads away in such a direction as to make the tension a minimum. 


g A uniform hemispherical shell rests with its curved surface against a 
rough vertical wall and a rough horizontal floor. If the equilibrium is 
limiting at both points of contact when the plane of the base of the shell 
makes an angle @ with the horizontal, prove that 


ἐκ: 2u(1 + μ) 
sin 0 ---------- 
a ee 
where μι is the coefficient of friction at both points of contact. 
Prove that such a position is impossible ify > «(2 — 1. 
(The centre of mass of the shell bisects the radius of symmetry.) (0.c.) 
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10 ‘The mass per unit length of a rod AB of length / varies linearly from 
m at the end A to 2m at the end B. Prove by means of the integral calculus 
that the total mass of the rod is 3 m//2 and find the position of the centre of 
gravity. 

An inextensible string of length a is attached to a point P of the rod at a 
distance a (< /) from the end A. The other end of the string is fixed to the 
point C on a rough vertical wall and the end A of the rod rests against this 
wall so that A is below C and AC = a. Find the tension in the string and 
the frictional force at the point A, and show that this vanishes if / = 18a/5. 

(M.E.I.) 


1x A uniform rod AB is suspended from a fixed point O of a rough vertical 
wall by a light inextensible string of length / which is fastened to B. The end 
A of the rod rests against the wall vertically below O with A below the 
level of B. If the rod is in equilibrium (not necessarily limiting) with 
OA = dand angle AOB = θ, find the tangent of the angle of inclination 
to the horizontal of the resultant reaction on the rod at A. 

If the angle of friction at A is A and equilibrium is limiting, with A about 
to slide downwards, show that 


cos(9 — A) = = ὡς A. 


12 O, A, B, C are the four corners of a square lamina OABC of side a. 
Forces act in the plane of the lamina as follows: 6P along AO, P along OC, 
2P along BC and 7P along BA. 

(1) Find the magnitude of the resultant of this system of forces, and the 
equation of its line of action referred to OA, OC as x- and y-axes. 

(22) Itis desired to maintain equilibrium by applying a force and a couple 
at one corner of the square. Find which corner should be selected so that 
the moment of the couple is as small as possible, and state the moment of 
this least couple. 

(11) Find the couple which must be applied to the lamina so that the 
resultant of the four original forces and the couple acts in a line passing 
through the centre of the square. (M.E.I.) 


*11.11 General dynamics and rotation about a fixed axis 
Dynamics of a system of particles 


Suppose that a system of masses is in motion and that a typical mass m; at 
point r; is under the action of an external force F; and an internal force 
Ι.. 


Then, by Newton’s second law, 


F, + Ι͵ ΞΞ mY; . 
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Adding these equations over all the particles of the system, 


VF + DA = Σ ms. (1) 


But, applying Newton’s third law, 


¥1,=0 


and if the total mass of the system is M (= Σ᾽ m;) and the centre of mass 15 
at r, then 


Mr = Σ᾽ mr; 
=> Mr = δ᾽ mF; 


Hence equation (1) becomes 
> F, = Mr, 


so that the centre of mass moves as though all the mass and all the external 
forces were concentrated upon it. 

When, for example, a diving champion makes a spectacular dive from a 
high board, his movement may be a complicated mixture of twists and 
turns and somersaults, and his body is certainly far from rigid. Neverthe- 
less, the only external forces are the weights of the various particles that 
make up his body, and we see from the above result that his centre of mass 
moves as though all these weights and all his mass were concentrated there: 
in other words, his centre of mass (which might itself be varying relative to 
his body as he curls up, ‘jack-knifes’, or stretches out) is bound to move ina 
perfect parabola. 


Exercise I1.11a 


1 A girl standing on perfectly smooth ice loses her balance. What is the 
path of her centre of mass? If a friend helped to save her as she fell, what 
would be the path of their joint centre of mass? . 


2 Find the position vector of the centre of mass of particles of masses 
4, 3, 2, 3 units at rest at the points 
ἐ-τ 7, 2—y, 2+, 24+ 3 
respectively. If each mass is acted upon by a force directed towards the 


origin and proportional to its distance from the origin, find the direction of 
the initial acceleration of the centre of mass. (L.) 


Rotation about a fixed axis : moments of inertia 


It is beyond our present scope to investigate further the general motion of a 
system of particles, and we must restrict ourselves to the special case of 
a rigid body which is rotating about a fixed axis: 
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Suppose that a rigid body is rotating with angular velocity ὦ about a fixed 
axis and that a typical particle of mass m is at a perpendicular distance r 
from this axis. 


Then the speed of m is rw and its kinetic energy is 3mr*w”. 


So the total kinetic energy of the body is © $mr*w? = Σ(Σ mr?)w?. 


Now the expression Σ᾿ mr? clearly depends only on the masses m and the 
way in which they are distributed in the body relative to the given axis. 
It is usually denoted by J, called the moment of inertia of the body about the 
given axis. 


So | Και. = $/m*, where J = ¥ mr’ 


Comparing the two expressions 
k.e. = gmv? and ke. = $lo’*, 


it might be expected that J will, for rotational motion, play a part very 
similar to that taken by mass for linear motion; and just as mass can be 
regarded as a measure of a particle’s inertia, or reluctance to accelerate 
when under the action of a given force, so the moment of inertia of a body 
about a given axis is a measure of its reluctance to accelerate rotationally 
when under the action of a given moment. 

Our first task, however, is to calculate the moments of inertia of a number 
of different rigid bodies. 


Circular hoop rotating about a perpendicular axis through its centre 


Suppose that a circular hoop of mass M and radius a is rotating about a 
perpendicular axis through its centre. 
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If a typical particle has mass m, 
T=) mr’. 

But, for every particle, r = a; 
so I= } ma? = (Σ᾽ m)a’ 

=> I= Ma’. 


Uniform circular disc, about a perpendicular axis through its centre 


Suppose again that the body has mass M and radius a, and also that its 
surface density 15. 


We begin by dividing the disc into a series of concentric hoops, and 
suppose that a typical hoop has radius 7 and thickness or. 


Then area of hoop = 2zr or 
and its mass = 2ΠΎΟΥ or. 
Hence m.i. of hoop = (2πστ 6r)r? 
= 2nor or 
and m.i. of disc = lim δ᾽ 2a0r° dr 


= | Qnor? dr 


0 
τὸ ee: 
= ἐπσαΐ. 

But M=o x πα = πσαΖ. 


So mi. = 3Ma’. 


Uniform rod 


Suppose that a uniform rod has mass M and length 2a. Its moment of 
inertia clearly depends on the chosen axis, and we shall consider two cases, 
both of them perpendicular to the rod and through its end and mid-point 
respectively. 


256 FURTHER VECTORS AND MECHANICS 


(a) About perpendicular axis through an end 


Let linear density of rod be p. 
Then M = 2pa. 
Taking short elements of length 6x and mass p ὄχ, we see that 


2 


m.i. = lim δ p dx x x 


2a 
= | px? dx 
0 


᾿ 8ρα" 
= [ξρχ" =. 
3 
But M = 2pa. 
4 2 
So mi. = = . 


(b) About perpendicular axis through its mid-point 


In a similar way, 


m.i. = lim δ p dx x x? 


a 3 
| px? dx = 2pa . 


ai 3 
So mi. = ἐμαξ. 
Radius of gyration 


Since J = Σ mr, 


we can also express it as Mx*, where xk is a length, called the radius of 
gyration of the body about this axis. 
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and we see that 


ee De mr 
κ΄ = 
> πὶ 
is precisely the variance of the distances r of masses m from the given axis, 
so that Καὶ is their standard deviation (or ‘root mean square’). 
Thus, for a uniform circular disc about a perpendicular axis through its 
centre, 


Mx? = 4Ma? 


Ξξὸο Καὶ 


~ 0.71]1α. 


rp 


and for a uniform rod of length 2a about a perpendicular axis through its 
centre, 


Mx? = 3Ma’ 


3 


~ 0.58a. 


=> K 


Two general theorems 


In the calculation of moments of inertia it is sometimes convenient to make 
use of a general theorem, and here we shall establish two: one for finding 
the moment of inertia about an axis in terms of the moment of inertia 
about a parallel axis through the centre of mass, and the other which is 
applicable only to a lamina. 
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Parallel axis theorem 

Suppose that the moment of inertia of a body about a certain axis is Ip 
and about a parallel axis through the centre of mass is Ig. 

We shall suppose that a typical mass m of the body lies at a point P and 
that a section is taken through P perpendicular to the axes (which in our 
figure are supposed to be perpendicular to the page) meets them in points O 
and G. 

Taking an origin at O, we let 
OG =e, GP = r, OP = R. 

Then Ig = Σ mR? = > mR? 
= » m(r + ec)? 
= Σ᾽ mr? +2) πιο ἜΣ, me? 
= > mr* + 2 Σ mr).e + ep m)e*. 


We now let the total mass of the body, δ᾽ m = M. 


O 
Also, since the centre of mass lies on the axis through G, 
δ mr = 0. 
So Ig = mr? + 2c.0 + (Σ᾽ m)e? 


=| Ib =I, + Me? 


So, for example, we could have found the moment of inertia of a uniform 
rod about its end from that about its centre, simply by writing 


= 4M? + Ma? a ae 
= $Ma’. 
And the moment of inertia ofa uniform hoop of 


mass M and radius a about a perpendicular 
axis through a point of its circumference 


= I, + Ma’ 
= Ma? + Ma? = 2Ma’. 


‘Perpendicular axes’ theorem for a lamina 


Suppose that perpendicular axes Ox, Oy, Oz are taken with Ox, Oy in the 
plane of a lamina and Oz perpendicular to its plane. Denoting moments of 
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inertia about these axes by /,, J,, L,, 


Io 


Hence, for example, we can immediately find the moment of inertia 7 ofa 
uniform disc about a diameter simply by observing that, since /, = J,, 
I+ I= 3Ma’ 


=> I[=4Ma’. 


Similarly, we know that a uniform rectangular lamina of mass M and 
sides 2a, 2b can be dissected into strips parallel to its sides, so that its 
moments of inertia about parallel axes through its centre are 


3Ma’* and 4M’. 
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Hence, using the perpendicular axes theorem, we see that its moment of 
inertia about a perpendicular axis through its centre is 


Ma? + 4MB? 
= 4M(a? + 67). 


Exercise 11.11b 


Calculate the moments of inertia of: 


1 A uniform rectangular lamina of mass M and sides 2a, 2b about a per- 
pendicular axis through a corner. 


2 A uniform circular hoop of mass M and radius a about: 

(1) a diameter; 

(1) a tangent; 

(11) a perpendicular axis through a point on its circumference. 


3 A uniform circular lamina of mass M and radius a about: 
(1) a tangent; 
(11) a perpendicular axis through a point on its circumference. 


4 A uniform sphere of mass M and radius a about: 
(1) a diameter; 
(11) a tangent. 


5 A uniform spherical shell of mass M and radius a about: 
(1) a diameter; 
(11) a parallel line through a point of the shell. 


(Hint: Σ᾽ m(y? + Ζῇ) = Σ᾽ m(z? + x?) = Σ᾽ m(x? + y?) 
and χ + γ᾽ + 27 = a?.) 


6 A uniform solid cone of mass M, height / and base-radius a, about: 
(1) its axis of symmetry; 

(1) a perpendicular axis through its vertex; 

(111) a diameter of its base. 


Dynamics of rotating bodtes 


Suppose that a body is rotating with angular velocity w about a fixed axis, 
that the moment of the external forces about this axis has magnitude L and 
the moment of inertia of the body about this axis is J. 

Further, suppose that gn a mass m at a distance r from the axis there are 
acting transversely: 
(1) an external force P; and 
(11) an internal force Q. 
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P+Q 


Then P+Q=mro 
=> (P+Q)= μιγ ὦ 
-ΣΡΞ δῷ = (δ᾽ πιγξγὼ = Io. 
But since the forces Q are internal, it follows from Newton’s third law that 
ΣΟ = 0. 
Hence ) 7rP = Io. 
But Σ te Sd 
So L=Iqa 
Now for a small rotation δθ the work done by P is Pr 00. So in a finite 


displacement, the work done by P is | Pr 4θ; and the sum of these amounts 
of.work for all the external forces 15 


Σ | Pra = [ΣΡ ao = | Lao 


dO do _ 1,90 
dt d@ © dé 


dw 
So [rao= | τα a0 = | Jodo 


=> [: dé = [1ω5] 


But 1 τ Ιὼ Ξ ! 


and again we see that the total work done by the external forces is equal to 
the gain in kinetic energy. 


Example I 


A uniform flywheel of mass 200 kg and radius 0.5 m is initially at rest and 
is made to rotate about its fixed axis by a rope which is wrapped round the 
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flywheel for three complete turns and then pulled with a steady force of 
200 N. Find: 

(1) the angular acceleration of the flywheel; 

(11) its final angular velocity. 


(1) As the flywheel is uniform, 

m.i. = 5 X 200 x (0.5)? = 25 kg m?. 

But the moment about its axis of the tension in the rope 
= 200N x 0.5m = 100 Nm. 

So the angular acceleration @ is given by 

100 = 250 => w= 4. 


Hence the angular acceleration is 4 rad 5. ὅ. 


(7) As this angular acceleration is constant throughout three complete 
revolutions (= ὅπ rad), the final angular velocity ὦ is such that 


w? = 05 +2x 4 x 6n = 48n 

=> w= 12.3. 
Hence the final angular velocity is 12.3 rad 5.1. 
Alternatively, the total work done by the tension 


= 100 Nm xX 6zrad = 6002 J 


and final k.e. =4x 25x w’ J. 
So ἐκ 25m? = 600z 

=> w* = 48x 

=> wo = 12.3. 

Example 2 


A flywheel whose moment of inertia is 1.01 kg m? is rotating at approxi- 
mately 1 200 rev min“ ', the value oscillating between 1.0 per cent above 
and 1.0 per cent below this figure. Calculate in joules the difference be- 
tween the maximum and minimum values of the kinetic energy. If the 
change from minimum to maximum occurs in one-hundredth of a second 
calculate the average power input in kilowatts during this period. 

Power to the flywheel is cut off at 1 200 rev min and the wheel is brought 
to rest by a constant couple of 80 N m: calculate how many revolutions 
the wheel makes in coming to rest. (M.E.I.) 


Maximum angular velocity = 1 212 rev min™* 


1212 x 2n 
—————__—— f 
60 
40.47 rads. 


ads! 
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Minimum angular velocity = 1 188 rev min“ * 


= 39.62 rad 5. ΄. 
So difference of kinetic energy 


_= 2% 1.01 x {(40.42)* — (39.6n)7} 
= 4x 1.01 x 80x x 0.82 
= 0.505 x 642? = 319 J. 


Hence average power input in 0.01 s 


319 
= Σ΄ = 31900 W = 31.9kW. 
0.01 a 


When power is cut, 
ke. = ἐχ 1.01 x (402)? = 808n? J. 


Now if the angle through which the flywheel turns before coming to rest 
is x, the work done by the braking couple is —80« N m. 


But work done = gain in k.e. 


So —80a = -- 808π2 
> a= 10.127 rad & 15.9 rev. 


Exercise I1.11c 


1 Calculate the angular momentum and the kinetic energy of: 

(1) ἃ uniform hoop of mass 8 kg and radius 3 m which is spinning in its 
own plane about its centre with angular velocity 6 rad gos 

(11) a uniform disc of mass 6 kg and radius 20 cm which is spinning in its 
own plane about its centre with angular velocity 4 rad s~'; 

(111) the same disc rotating at the same speed, but about a diameter; 

(10) a uniform bar of mass 3 kg and length 4 m rotating about one of its 
ends with angular velocity 6 rad s"'; 

(v) a hollow cylinder of mass 100 kg and radius 40 cm, rotating about its 
axis at 100 revolutions/min; 

(vi) the earth, assuming it to be a uniform sphere of mass 6 x 1077 kg 


and radius 6.4 x 10° τη. 


2 A uniform flywheel has mass m and radius a, and is rotating freely 
about its axis with angular velocity w. Ifa constant braking force F is then 
applied tangentially to its surface, find: 

(2) its angular deceleration; 

(1) the time it takes to stop; 

(11) the angle through which it turns. 


3 A bicycle is held with one of its wheels rotating freely with angular 
velocity 200 rev/min. The wheel has mass 2 kg and radius 25 cm, and 
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when the brakes are lightly applied it is brought to rest in three revolutions. 
Assuming that the mass of the wheel and the constant braking force are 
both concentrated at its circumference, find the magnitude of this force. 
For how long would it need to act; and if z = 0.1, what is the total contact 
force between the brakes and the rim? 


4 A gyroscope consists of a uniform disc of mass 2 kg and diameter 
10 cm mounted on a light axle of diameter 4 mm. A string is wrapped 
round the axle 20 times and pulled sharply with a constant force. If this 
pull takes 2 5 to unwrap the string, find: 

(2) the angular acceleration of the disc; 

(11) its final angular velocity; 

(11) the tension in the string. 


5 Arope 2 m long is wound round the axle of a flywheel and pulled with 
a constant force of 400 N. When the string is unwound the flywheel is 
rotating at 200 rev/min. Find its moment of inertia. 


6 A uniform rod of length 2a and mass M 15 freely pivoted at one end and 
is initially in a vertical position above its pivot. If it then topples over, find 
its loss of potential energy by the time it is (2) horizontal, (11) vertically 
below the pivot; and so find its angular velocity at these two positions. 
Finally, by considering the moments of its weight, find the corresponding 
angular accelerations. 


7 <A uniform cylindrical flywheel of mass 1000 kg and radius 0.4 m is 
being accelerated by an electric motor which is working at a constant rate 
of 10 kW. Ignoring all resistances, find the angular velocity of the fly- 
wheel after one minute. 


8 A heavy uniform rod of mass m and length / is suspended from one end 
and oscillates freely about its position of equilibrium. Find: 

(2) the moment of its weight about the point of suspension when the rod 
makes an angle θ with the downward vertical ; 

(1) the equation of motion when the rod is in this position ; 

(111) an approximate equation of motion when @ is small; 

(wv) the period of small oscillations. 


g A light thread is wrapped round a uniform cylinder of mass m and 
radius a which is free to rotate about a horizontal axis. A mass M is tied to 
the end of this string and allowed to fall from rest in a vertical line. After it 
has fallen a distance x, find: 

(2) its speed v; 

(11) its acceleration a. 


10 Auniform circular disc, of mass m and radius a, is free to rotate about a 
frictionless horizontal axis through its centre and perpendicular to its plane. 
One end of a light string is attached to a point on the circumference of the 
disc and part of the string is wound on the circumference. The other end of 
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the string carries a particle A of mass 2m hanging freely. The system is 
released from rest and a restoring couple of moment mgaO acts on the disc 
when @ is the angular displacement of the disc. Write down the equation 
of motion of the disc about its centre and the equation of motion of A. 
Hence show that the tension in the vertical part of the string is #(1 + 20)mg. 
Show also that 


40 


5 nee 
“ae 


= 49 — 2206. 


Deduce that the motion of A is simple harmonic and find the period of 
this motion. 

(You may assume that part of the string always remains wound on the 
disc during the motion. ) (O.c.) 


11.12 Dimensions 


It will have been noticed that this introduction to mechanics has been 
based entirely upon the three fundamental concepts, or dimensions, of mass, 
length and time. If we now use the symbol [ _] to indicate the dimensions 
of a quantity, we can write 


[mass] = M, [length] = L, [time] = T. 


Furthermore, we can proceed to find the dimensions of other quantities in 
terms of M, L, T, such as 


[area] = [length x length] = L?, 


[volume] = [area x length] = 1,5, 


M 
[density] = ee 
volume 


—. = ML", 
length L 
[velocity] = ΕΞ | = —=LT" 


time T 
eed LT? ᾿ 


[acceleration] = = ——=LT”’, 


time 
[force] = [mass x acceleration] = MLT ~?, 

[moment of a force] = MLT~2 x L = ΜΙ ΤΟ", 
[impulse] = [force x time] = MLT~? x T = MLT“?, 
[momentum] = [mass x velocity] = MLT™’, 

[work] = [force x distance] = ML?T ~*, 

[kinetic energy] = [mass x (velocity)?] = ML?T ~?, 
[power] = ES = ΜΙ, 1 -°. 


me 
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It will be seen that the dimensions of impulse are identical (or consistent) 
with those of momentum, and that the dimensions of work are consistent 
with those of energy. This notion of consistency can be used to check, and 
sometimes to derive, equations in mechanics. 


Example 1 


A piano wire has mass m, length /, and tension F, and a student knows that 
its period of vibration is 


ml Ff 
ith aa sig Ὡς 
either an |(™) or an | (5) 


Which is more likely to be correct? 
Since [m] = M, [/] = L, [F] = MLT ~°, it follows that the dimensions 


of these two possibilities are 


να) « {πὴ} 
ΞΞὶ as | ey eo baad 


Hence it is clear that the second possibility for a period of dimension T 
would not be consistent, so that the likelier. (and in fact correct) formula is 


2n,/(ml/F). 
Example 2 


A simple pendulum consists of a bob of mass m attached to a string of 
length / and the pendulum is swinging at a point where the acceleration 
due to gravity is g. Investigate, by means of dimensions, the way in which 
the period 7 depends upon m, /, and g. 

Firstly, we suppose that the period T depends jointly upon m, /, and g in 
such a way that: 
T oc m*l®g?, where a, B, y are constants. 
But [m/8g?] = M*L? (LT 2) 

= ΜΊ Τ. 
and [7] = T. 
So for dimensional consistency, 
ΜΊ, ΤΙ 2? = T 
=> a=0 
Ξε, ΞΡ Ὁ 
Bry=0 5 Bay 


Hence T oc m°l#g7? 


rei) 
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It will be recognised that although this does not constitute a proof, it 15 
nevertheless a highly convenient approach to such an investigation, which 
is confirmed by the results of p. 233 that T = 2z,/(I/g). 


Exercise 11.12 


1 State the dimensions, in terms of M, L, T, of the following quantities: 


1 pressure (force per unit area) ; 
1) line density; 
1) surface density ; 
frequency ; 
angular velocity ; 
angular acceleration ; 
angle ; 
moment of inertia; 
rate of loss of mass (e.g., of a rocket) ; 
constant of gravitation (see 8.11) ; 
stiffness of a spring (see 11.8); 
surface tension (energy per unit area). 


~~. 
Ξ 
~~” 


Ξ 
. SNe 


ὧς 
τῷ, τῷ 
ee ὧῶ 
mM! . ᾿ς τ 
ς-ς, 


wee” 


x= Ὁ. 

. =, ~~ Pd 

φῶ ς / ~~” 
~~ 


δὲ 


ER NNN ONIN ON OO ON 
Se 


= 


2 Use the method of dimensions to predict how: 


(¢) The tension Tin a string depends upon the mass m of a particle which 
is being whirled round on its end in a circle of radius 7 and with speed v. 
(1) The height h reached by a stone will depend upon its mass m, the 
energy £ with which it is projected vertically and the acceleration g due to 
gravity. 

(22) The speed v of sound in a gas will depend upon its pressure #, its 
density p and the acceleration g due to gravity. 

(1) The frequency f of oscillation of a light spring depends upon its 
stiffness , the mass m which it is supporting and the acceleration g due to 
gravity. 

(v) The velocity v of waves in a deep liquid depends upon its density p, 
their wavelength / and the acceleration g due to gravity. 


3 Use the method of dimensions to find an expression for the thrust of 
moving air on a fixed obstacle, assuming it to be the form T = kv%p?A’ 
where v is the air velocity, p its density, A the frontal area which the obstacle 
presents to it, and k is a non-dimensional constant. Explain briefly the 
nature of a dynamical argument which leads us when using absolute units 
of force to expect the experimental value of k always to be less than unity. 

Under certain conditions the compressibility ¢ of the air can affect the 
thrust appreciably, ¢ being defined as fractional change in volume produced 
per unit change in pressure. Find the dimensions of ¢ and show that we can 
now obtain a dimensionally correct formula for T by replacing k by any 
polynomial in a quantity z defined as v,/ (cp). (M.E.I.) 
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Miscellaneous problems 11 

1 Let O be the centre of the circumcircle of AABC whose vertices have 
position vectors a, b and ec. Show that the point H, with position vector 


h =a+ 6+ ce, is the orthocentre of AABC (i.e. AH | BC, BH L CA, 
CH | AB) and hence show that the centroid G divides OH in the ratio 1 :2. 


2 The sides BC, CA, AB of AABC are divided in the same ratio by 
P, Q, R respectively. Investigate the centroid of APQR. 


3 ΟὉ 15 ἃ point in the plane of a triangle ABC and AO, BO, CO meet 
BC, CA, AB respectively in points P, Q, R. Using bold letters to denote 
position vectors from O, prove 

(2) that it is possible to find scalars «, B, y such that 


aa + Pb + ye = 0; 
(1) that p can be expressed as 
α _ pbs ye 
“βιγ BH 
(111) that P divides BC in the ratio y/B; 


(10) Ceva’s theorem, that 


BP CQ AR 


— Χ x —= 1] 
PC QA RB 


Finally, verify Ceva’s theorem in the three cases when O is the centroid, 
the orthocentre, the circumcentre, of AABC. 


4 Ifa line cuts the sides BC, CA, AB (extended if necessary) of a triangle 
ABC prove Menelaus’ theorem, that 


BP : CQ : AR _ 
PC” QA’ RB 
5 ABC, A’B'C’ are two skew lines and AB: BC = A’B’:B'C’. Prove that 
the mid-points of AA’, BB’, CC’ are collinear. 


6 The vertices of a tetrahedron are A, B, C, Ὁ and the points P,Q, R, 5 
divide the segments AB, BC, CD, DA in the ratios p:1, g:1, 7:1, 5:1 where 
p, 4,1, 5 are all positive. Show that P, Q, R, S are coplanar if pgrs = 1. 
Show that, if also = r and q = s, then PQRS is a parallelogram. 
(C.S. ) 


7 The base of a solid hemisphere of radius a is firmly attached to one 
plane end, also of radius a, of a solid cylinder of length /, thus forming one 
solid of revolution. The material is the same and uniform throughout the 

composite body. Find the position of its centre of gravity. 
If the body can rest in equilibrium with any point of the surface of the 
hemi-spherical portion resting on a horizontal plane, show that a = 1,/2. 
(Μ.Ε.1.) 


— 
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8 Prove that if the sum of the squares of two opposite edges ofa tetrahedron 
is equal to the sum of the squares of another pair of opposite edges, then 
the remaining pair of opposite edges are perpendicular. 


g OA, OB, OC are three lines through the point O and the angles BOC, 
COA, AOB are a, B, y respectively. Calculate cos* 0, where θ is the angle 
between the line OA and the plane OBC. (c.s.) 


10 A ball of mass m is thrown into the air with initial velocity u, and 
during its flight it experiences a resistance from the air given by the vector 
—mcv, where v is the velocity of the ball at that instant. Write down a 
differential equation for the velocity as a function of the time, and verify 
that this equation and the initial conditions are both satisfied if 


where g 15 the vector acceleration due to gravity. 

Find an expression for r, the displacement of the ball from its initial 
position at time ¢; and show that, if ¢ is small, then this is approximately 
equal to 


r* — c(gg + oi u), 


where r* is the position that it would occupy in the absence of air resistance. 
- (Μ.Ε.1.) 


1r A particle of unit mass is attracted to a point O with a force —9r, 
where r is the position vector of the particle relative to O. Initially the 
particle is at a point with position vector 167 and moving with velocity 247, 
where ὦ and 7 are fixed unit vectors in perpendicular directions. Verify 
that the subsequent motion is described by the equation 


r = 162 cos 3¢ + 87 sin 34, 


and interpret this equation geometrically. 

If there were also a resistance to motion of magnitude ten times the 
speed, obtain a differential equation for the motion and show that it has a 
solution in the form 


r=Ae'+Be™. 


With the same initial conditions as before, find the values of A and B, and 
investigate the nature of the motion for ¢ > 0. (S.M.P.) 


12 A particle of mass m is projected with velocity V at an angle « to the 
horizontal. The air resistance is mk times the speed of the particle, where k is 
a constant, and opposes the direction of motion at all times. Prove that the 
equations of motion of the particle can be put in the form 

χε kx = 0 


y+hy+g=0 
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where x and y are respectively the horizontal and vertical distances from 
the point of projection. 

Show that 


V 
x = 7 cos α(] — e *) 


and find a similar expression for y. 
Sketch the path of the projectile. (M.E.I.) 


13 Asteady uniform stream of air of density p and speed u strikes at right 
angles a plane surface of area A and proceeds, after the contact, with its 
effective speed still in the original direction and equal to ku (where k > 0). 
Derive an expression for the thrust of the air on the area, stating carefully 
the principles on which your derivation is based. 

A householder has a garage with rectangular doors each 1.20 m wide 
by 2.25 m high, turning about vertical hinges at their edges. On a windy 
day he attempts to keep one door open in such a position that the wind is 
blowing at right angles to the door. To do this he puts a brick on the ground 
at the outer corner of the door, hoping that the friction of the brick on the 
ground will be sufficient for the task. Given the data listed below, calculate 
the greatest wind-speed for which the brick will be effective: state any 
additional assumptions you find it necessary to make in the course of your 
work. | 

Mass of brick = 3.0 kg; coefficient of friction = 0.4; density of air = 
1.10kgm~°;k = 0.6;g = 10ms “. (M.E.I.) 


14 Ashell of mass M is at rest in space, when it bursts into two fragments, 
the energy released being 15. Show that the relative speed of the fragments 
after separation cannot be less than 2.) (2E/M). 

Explain how your conclusion is affected if the shell is moving initially 
with speed U. (C.S. ) 


15 <A space craft of mass 107 kg is going round the moon in a circular 
orbit at a height of 10° m. Find the time of the orbit, taking g on the moon 
as 1.60 ms * and the diameter of the moon as 4.14 x 10° m. 

To escape from the moon the kinetic energy of the body has to be 
doubled. What is the magnitude and direction of the least impulse which 
can be applied to the body to enable it to escape? , 

If this impulse is spread over a period of one minute, what is the apparent 
weight of a 100 kg man inside the space craft during this time? (M.E.I.) 


16 A particle of mass m is attached to the middle point of a light elastic 
string of natural length ἃ and modulus mg. The ends of the string are 
attached to two fixed points A and B, A being at a distance 2a vertically 
above B. Prove that the particle can rest in equilibrium at a depth 5a/4 
below A and find the period of oscillation if it is displaced slightly in the 
vertical direction. 
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17 A mountaineer falls over a cliff. He is attached to a rope which, 
providentially, stretches so that he just touches the ground at the foot of 
the cliff. Find the height of the cliff and the time taken for the mountaineer 
to reach the ground (in terms of his mass, the length of the unstretched rope 
and its elastic modulus). (c.s.) 


18 ‘Three particles A, B and C each of mass m lie at rest in a straight line 
on a smooth horizontal table, joined by equal taut strings AB, BC. A hori- 
zontal impulse P, is given to B in a direction perpendicular to the line of 
the strings. Describe the nature of the motion up to the instant when A 
and C meet, and calculate the loss of energy of the system if these masses 
do not separate after colliding. (M.E.I.) 


1g A pump working effectively at P kW delivers water through a nozzle 
of area A cm? at a speed of v πὶ 5. ᾿, raising the water / m in the process. 
Obtain a formula for P in terms of A, v, A and g; assume that 1 m? of water 
has a mass of 10° kg. 

The jet is directed horizontally at right angles to a vertical blade of a 
water wheel which is initially at rest. The wheel has a moment of inertia J 
about its axis which is horizontal and the point of contact of the water is cm 
from the axle; the water falls vertically after striking the blade. 

What is the initial angular acceleration of the wheel? At what rate is 
energy being given to the wheel initially? (M.E.1.) 


20 A tank containing a liquid of density p has a hole in the bottom. The 
speed μι at which the liquid escapes (averaged over the cross-section of the 
hole) is given by u = ApVg, where V is the volume of liquid in the tank and 
A is a constant. Find the dimensions of A in terms of mass, length and time. 

Given that the hole has cross-sectional area A, write down a differential 
equation for V as a function of ¢, in terms of the various constants. (s.M.P.) 


21 The velocity of propagation of waves of wavelength / on the surface 
of a liquid, is proportional to 


fe 
p 


where σ᾽ is the surface tension and p the density. Deduce the physical 
dimensions of o. 

Lord Rayleigh showed that the period, T, of vibration of a small liquid 
drop, when given a slight distortion, depended only ong, p and the radius 7 
of its spherical equilibrium shape. Assuming that 


T = Co*%p?r’, 


where C, a, β and y are dimensionless constants, find a, B and y. (α.5.) 


12 


Probability distributions and 
statistics 


12.1 Probability distributions and generators 


Take two coins, spin them as a pair ten times and record your results: two 
heads (HH), two tails (TT), or one of each (HT). When the author per- 
formed this experiment he obtained the sequence 


HT, HT, TT, HT, HH, HH, HT, HT, HH, HT, 


which can conveniently be classified according to the frequency f, with 
which the number of heads obtained is x,: 


No. of heads (x,) Frequency (/,) 
x, = 0 fi =1 
Xo = | fo = 6 
x3 =2 25. = 3 
n= 10 


In chapter 7 we called this a frequency distribution, and saw that its 
mean m 15 given by: 


] 

m=—) fx, πὶ ΠΠ x0+6x14+3x 2) =1.2 
n 

and its variance s* by 


] 
[ἢ τ - Σ λα, Ν m)* 


n 


so — μι 


n 
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= io(1 x 0? +6 x 1743 x 2?) — 1.2? 

= 1.8 — 1.44 

= 0.36. 


If instead of recording frequencies, we were to tabulate the relative 
frequencies f,/n = g, we obtain: 


No. of heads (x,) Relative frequency (g,) 
x, =O δι = τὸ 
x2 ΞΞ 1] 52> τὸ 
Χᾳ = 2 £3 Ξ τ 


Furthermore, we can see that 


m=) 4,5, 
and s* = Σὲ g(x, —m)? = ) gx? — πιῶ. 


So far we have made no assumptions about the coins and have not 
remarked upon any expectations which we might have had. Our task has 
been simply to record a set of events and to describe them as conveniently as 
possible. If; however, we now make assumptions about the coins and the 
way in which they are spun, we can use the probability theory of chapter 7 
to predict our expectations. Suppose, for instance, we assume that the coins 
are both completely unbiased, and they are spun with absolute fairness; 
then the probabilities p, of obtaining x, heads are clearly: 


No. of heads (x,) Probability (p,) 
x 5 
1, = 0 1" 4 
xo = l po = 2 
χα = 2 Ps -Ξ rs 


This set of probabilities is referred to as a probability distribution, and the 
set of assumptions (which must be expressed in precise probability terms) 
from which it arises is called the probability model. 

If different assumptions were made, we would have a different proba- 
bility model, and so a different probability distribution. For instance if 
one of the coins was double-headed, and the other was fair and unbiased, 
then the probability distribution would be as follows: 


xp Pr 
x, =0 fp, = 0 
xo = 1 bo Ξ 2 
tg = 2 pbs =3 


One of the major tasks of statistics is to compare an observed frequency 
distribution with the probability distribution which arises from a par- 
ticular probability model, so equivalent measures to the mean and variance 
of a frequency distribution are required for a probability distribution. To 
maintain a clear distinction between the two distributions, such probability 
measures are denoted by the Greek letters 1, 0”. 
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Now m=) g,x, and s* = Σ᾽ σ,ία, — πὴ)", 

Ξ:3 ΕΝ 2 
sO μ- DP and σ΄ = » P(x, = μ) : 


_As in statistics, so with probability distributions, wis called the mean, 


or exbected value of x. o7 its variance and o its standard deviation. 


So for the probability distribution with two unbiased coins, 


xO4¢4x%14+5x2=1 
x 1244x007? 44x 1? =05 


c= 


(whereas in our trial, m = 1.2 and s* = 0.36). 


[It should be pointed out at this stage that in the calculation of pando, 
just as for m and s, the values of x, are not restricted to integers: see, for 
instance, Exercise 12.1a, no. 8. ] 


In summary: 


Statistics Probability 
Relative frequency g, = fr Probability p, 
n 
Mean m= yt x, Mean (expected value) p = δ᾽ p,x, 
Si 

er (x, _ m)* Y) Prl x, ra pu)? 
Variance 52 =4 " Variance σὲ =<" 

yt — mt Di Pete ~ μ 

r n r 


Exercise 12.1a 


1 Calculate the expected value and variance of the number of heads 
showing when three unbiased coins are tossed together. Conduct a series 
of 20 such trials and calculate the mean and variance for the frequency dis- 
tribution that you obtain. Compare your theoretical and experimental 
results. 


2 Calculate: 

(1) the variance of the score obtained on throwing a single die; 
(11) the variance of the total score obtained on throwing two dice; 
(111) the variance of the average score on two dice. 

How are your answers to (i) and (a) and to (1) and (111) related? 
(Assume that the dice are ‘fair’.) 


3 A computer is made to produce randomly the numbers 0, 1, 2,..., 9, 
What is the expected value and variance of numbers so produced ? 
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4 On the basis of past evidence, it is estimated that the probabilities of a 
certain type of plant having 4, 5, 6, 7, 8, 9 leaves are 0.13, 0.21, 0.38, 0.16, 
0.09, 0.03 respectively. Find, correct to 2 decimal places, the expected 
value and variance of the number of leaves on such a plant. . 


5 [wo unbiased dice are rolled and the greater score (or either if they are 
the same) is recorded. State the set of possible scores and the probabilities 
associated with these scores. Find the expected value (i.e., theoretical 
mean) of the recorded scores. (S.M.P. ) 


6 A boy spins a coin until he obtains a head, but impatiently gives up if 
he does not succeed in five attempts. What are the expected value and 
variance of the number of times he spins the coin? 


7 What are the mean and variance of the number of different factors 
(other than | and the number itself) of an integer chosen at random in the 
range 1 to 30? 


8 A box contains 100 tokens which differ in mass, but are otherwise 
identical. 20 of the tokens have a mass of 4.8 g each, 35 a mass of 5.2 g, 
25 a mass of 3.7 g, 15 a mass of 6.5 g, and the remaining 5 have a mass of 
8.0 g each. Calculate (to 2 decimal places) the expected value and variance 
of the mass of a token chosen at random from the box. 


g An examination question consists of two parts, A and B, and the 
probability of a pupil getting part A correct is +. If he gets A correct, the 
probability of getting B correct is $; otherwise it is +. There are three marks 
for a correct solution to part A, two marks for part B, and a bonus mark if 
both parts are correct. Calculate the expected value and variance of the 
pupil’s total mark for the question. 


10 Ina game where the gambler rolls two dice, on a £1 stake the casino 
pays out £10 for a double six, and £3 for a score of seven (the stake money 
being returned as well), and for any other score the gambler loses ‘his 
money. What is the casino’s expected profit (to the nearest penny) on a 
£100 stake? 


11 ‘T'wo small piles of cards contain respectively the 1, 2, 3, 4, 5, 6 of 
diamonds and the 7, 8, 9, 10 of diamonds. Verify that these sets of numbers 
have expectations 33 and 8% respectively, and that the variances are 214 
and 14 respectively. | | 

One card is drawn at random from each of the piles and the product 
of the numbers so formed is calculated. Find the expected value and 
variance of this product. 

Find also the probability that the product from any one draw will exceed 
its expectation. (c.) 


12 A motorist drives into town and has the choice of two car parks. It 
takes him 20 minutes to drive from his home to car park A, which is never 
full, and then it takes him 15 minutes to walk to the office. If he decides 
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to try to get to car park B it takes him 25 minutes to get there from his home. 
The probability that car park Β is not full is p, and in this case he can walk 
to the office in 5 minutes. If car park B is full he drives back to car park A, 
which takes 5 minutes. If he always drives first to car park B find the proba- 
bility distribution of the time 7, in minutes, taken to get to work. Find the 
expected value of 7, showing that its value is less than 35 if p exceeds §. 
Find also the variance of 7. 

If however, the motorist tosses a coin in order to decide whether to go 
straight to car park A or to try car park B first, find, in this case, the proba- 
bility distribution of 7, and the expected value of T- (c.) 


Probability generators 


Suppose that we want to find the mean and variance of the geometric 
probability distribution (see 7.4) of the number of throws required of a 
‘fair’ die until a six is obtained. 

The probability distribution is given by: 


No. of throws Probability 
x, = 1 ριξε 
χ) = 2 fo τΞτὲ XG =36 
x3 = 3 bs =% X ()" = ste 
ἊΝ pbx ἀντ 
Hence p =< 


ae ΧΙ. ἐς Χ3---: 
and σῆ = (ἰ κ12-π χ 2. χ Pte) -- pw. 


Direct calculation of such infinite sums will be lengthy, to say the least, 
and even where the number of outcomes is not infinite, calculation of yu 
and σ΄ can be very tedious. But, particularly when the variables x, are 
integers, calculations can be eased considerably by using the probability 
generator (or as it is sometimes known, the probability generating function) : 

Let the outcomes which have associated random variables 0, 1, 2,... 
have probabilities denoted by fo, /;,f2,-..-. Then the probability 
generator G(t) is defined by 


G(t) = > pt” = po + pit + pot? +--+ + ρμ" -Ἐ .Ὁ.. 


In the probability generator, ¢ is a dummy variable: it has no significance 
in itself, and any other letter would do just as well. We merely use the 
powers of ¢ to pick out equivalent probabilities — the coefficient of δ in 
G(t) giving the probability of obtaining the value n for the random variable. 
It should also be noted that G(t) may be a finite or infinite polynomial in ¢. 

It is possible also to define a probability generator when the random 
variables are negative or non-integral (see Exercise 12.1b, no. 13). In such 
a Case, of course, the generator will not be a polynomial, but nevertheless 
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the properties of G(t) which we will shortly derive will still hold. In practice, 
however, such probability generators are not often used. 

Clearly the probability generator provides us with a means of displaying 
a complete probability distribution in a single expression. For instance, the 
distribution for the number of heads obtained on throwing two unbiased 
coins is summarised by the generator: 

G(t) =< + gt + ΤῸ. 

It is not immediately obvious how a probability generator can be used to 
evaluate ps and a”, but we shall see shortly that they are calculated from the 
first and second derivatives of G(t). Meanwhile, to aid differentiation, the 
generator will need to be written in as simple a form as possible, so let us 
simplify the generator of our geometric distribution: 


Gi) = M+ bx Get dx Ot thx Ot 


mer eae at Fi DEN ἃ 
ἥδ. 6 6 6 


Now the contents of the square brackets can be recognised as a geometric 
progression whose sum is 


Ι — (5¢/6)" 

lim | 3 Ξξ---- - ον 

ΠΟ} 1 -- 5t/6 

and since ¢ has no significance in itself, we can choose it such that 5.6] < 1, 
ot \" 

so that (=) -»Ὸ asn—- οὐ. 


Ι ; 
Η G(t) = 44 x ——__ = __—. 
cM es aa ae 77 τ τ 


Derivation of μ, σ΄ from the probability generator 


When the random variables x,,x,,... are 0,1, 2,... the formulae for 
expected value and variance take the form: 
H= Dp, 0 =) rp, -- μ'. 
Now G(t) = Σ᾽ 3, 
=> G'(é).= ape => Gi(l)=> 9”, 
> Gt) - Σ, τίν -- lp? => G6") = τίν -- 1p, 
=> G"(1) + σ΄ (]] = > 1b, - 
Hence p = ) 7p, = G'(1) 
and σ΄ = δ᾽ rh, — μὲ =G"(1) + G(1) -- [σ΄ (1:3. 


Summarising : p= G'(1) 


= G"(1)+ 6()) -- (67 
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Returning now to our example, 


t 


ee) 
=> Gi= ae 7 
= Ὁ - 302 rT => G'(l) =6 
=> G"(t) = Ἐ- την G"(1) = 60 
=> u=6 and o% = 604+ 6 -- 6° 
= 30. 


Hence the expected number of throws required to obtain a six is 6, and 
the standard deviation of the number of throws is τ΄ 30. 


Binomial distribution 


We saw in 7.4 that if the chance of a particular event occurring in a single 
trial 15, then in z such independent trials, the probabilities of 0, 1, 2,3,...,n 
occurrences are given by the terms of the binomial expansion 


(q+ p)", whereg = 1 — fp. 


These probabilities, therefore, are also the coefficients of successive 
powers of ¢ in the expansion of 


G(t) = (q + pt)" 


which is therefore the probability generator of the binomial distribution. 
Now σῷ = (φ + pi” 

= G'(t) = πρίφ + pi) 

=> w= G'(1) = mp(q tp)” = mp, sinceg + p = 1. 

Furthermore G"(t) = n(n — 1)p?(q + pt)" ” 


=> G"(1) = n(n — 1)p° 

=> σ΄ =G6"(1) + G (1) -- [{(' 7’: 
= η = np? + np — np? 
= np(l — p) 
= npg. 


Hence p= np σ = ./npq 
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The result that, for a binomial distribution, 


G(t) = (q + pt)" = (φ + ῥὴ(φ + pt)...(9 + pe), 

where q + ft is the generator of a single trial, illustrates the fact that for a 
probability distribution arising from a series of independent (and not neces- 
sarily identical) trials, the overall probability generator is the product of 
the generators of each of the separate trials. 


Example 


In a certain large city, it is known that 3 of the voters support the radical 
party. In a sample of 12 voters, what is the expected value and standard 
deviation of the number of radicals? 

We have therefore a binomial probability situation with n = 12, p = ἐ 
and qg = 3. 


So w=nm=12x3z=4 
and og = nog = {((12 x 3 x ἢ) = (ἢ & 1.63. 


Exercise 12.1b 


1 Calculate the probability generator for the number of tosses of an 
unbiased coin needed to obtain the first head, and hence evaluate the 
mean and variance of the number of tosses. 


2 Calculate the expected value and standard deviation of the number of 
heads obtained when an unbiased coin is tossed (7) 4, (11) 36, (111) 100 times. 


4 In the next general election suppose that 36% of the electors intend to 

vote Liberal. Find, in terms of n, the mean and standard deviation of the 

percentage of those who intend to vote Liberal in samples of size n. 
(S.M.P.) 


4 Six players take it in turns to cut a pack of cards (excluding jokers), and 
each time the cards are replaced before the next player cuts. Calculate the 
expected value and standard deviation of the total numbers of spades 
turned up when each player has cut. 


5 The probability of a pupil arriving at school late on any given day is τ. 
What is the probability of his being punctual for a whole week (i.e., 
5 school days) ? Calculate the mean and variance of the number of days he 
will be late in a school term consisting of 14 weeks (i.e., 70 days). Also cal- 
culate the expected number of completely punctual weeks in the term. 


6 Seedlings are planted in 10 rows of six each. The probability ofa seedling 
dying before it flowers is ¢. Calculate the mean and variance of the number 
of rows in which all the seedlings flower. 


7 Eight per cent of eggs sold by a certain grocer are brown. 
(2) Calculate the mean and variance of the number of brown eggs obtained 
in a carton of 12. 
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(1) By finding the probability of a carton containing no brown eggs, and 
using a suitable probability generator, find how many cartons a housewife 
can expect to buy before she finds her first brown egg. 


8 Prove that for a binomial probability distribution arising from ἢ trials, 
the maximum possible value of the variance is n/4. 


g Anelectronics firm packs the resistors that it produces in boxes of 800. 
On average one component in 100 is faulty. Calculate the expected number 
of defective resistors in a box, and the variance of this number. How could 
you quickly obtain a good approximation to the variance in this case? 
What would the percentage error (of the actual value) be if this approxima- 
tion were used? 


10 A box contains a large number of screws. The screws are very similar 
in appearance, but are in fact of 3 different types, A, B and C, which are 
present in equal numbers. For a given job only screws of type A are suitable. 
If 4 screws are chosen at random, find the probability that: 

(1) exactly two are suitable; 

(11) at least two are suitable. 

If 20 screws are chosen at random, find the expected value and variance of 
the number of suitable screws. (c.) 


11 In a machine game of chance, when a lever is pulled, one of the 
numbers 1, 2, 3 appears in a window. The lever is pulled 5 times and the 
total score is recorded. If the probabilities associated with the numbers 
1, 2,3 are4, 4, 4 respectively, write down an expression for the probability 
generator G(t) for the possible total scores. Evaluate G(—1) and, hence or 
otherwise, find the probability that the total score is even. (S.M.P. ) 


12 By using the fact that the probability generator for the number of 
throws of a die required to obtain a six is ¢/(6 — 5), or otherwise, obtain 
the probability generator for the number of throws required to obtain two 
sixes (not necessarily consecutively). Hence calculate the expected value 
and variance of the number of throws required to obtain two sixes. 


12 A multiple-choice examination paper consists of 25 questions, each 
with 5 possible answers, only one of which is correct. A student gains 4 marks 
for a correct answer, and loses one mark for an incorrect answer. 

(1) An entirely ignorant student has to guess the answer for each question. 
Find the probability generator for the number of marks he obtains on the 
paper, and hence calculate his expected total mark, and the standard 
deviation of that mark. (Hint: first find the probability generator for the 
marks on a single question.) 

(ἢ) Arather more intelligent student has twice as much chance of choosing 
the correct answers as he has of choosing any one of the wrong answers; 
find the expected value and standard deviation of his total score on the 


paper. 
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14 A petrol company gives away one medal with every purchase of 
petrol. Each medal is equally likely to be any one of a set of n medals. At a 
stage when a customer already has s different medals, show that the 
expected number of purchases to get the first medal different from those 
already held is n/(n — s). Deduce that the expected number of purchases 
for a complete set is 


ἡ 7,4 wi od ri (c.) 
25 5 Ὰ5 ῃ ᾿ 


12.2 Continuous probability distributions 


Statement 1: The probability of throwing a 5 with a fair die is +. 


Statement 2: The probability of a voter, chosen at random in London, being 
a Liberal is <. 

Statement 3: The probability of the new machine turning out a component 
of length 241 mm is ἐ. 


Superficially the three statements above seem very similar. They all 
make predictions about what may occur if we conduct certain trials, and 
we presume that there is some foundation, theoretical or experimental, for 
each. But let us examine the statements separately, and in a little more 
detail. 


Statement 1 is presumably based on theoretical considerations (themselves 
supported by past experience) and uses the notion of ‘equal likelihood’. 
It tells us that if we conduct a large number of trials, about 2 of the outcomes 
will be 5’s; and that, in general, the larger the number of trials, the closer 
we get to this fraction. 


Statement 2 can only be based on statistical evidence, and tells us that in a 
large, randomly chosen group of voters, we could expect about + to be 
Liberals. 

Both of these statements are concerned with discrete (i.e., definite and 
distinct) outcomes: when a die is thrown there are only six distinct results 
possible, and similarly a voter has political views which can be expressed 
in certain distinct choices (and we would regard ‘don’t know’ as a category 
of choice). 


Statement 3 however, is concerned with length, a continuous quantity, and 
presumably tells us that if, for instance, we look at 150 components pro- 
duced by the machine, about 25 will be 241 mm in length. But if we take 
these 150 components, and measure each by a micrometer to 0.001 mm, 
then we would certainly not expect 25 of them to be exactly 241 mm in 
length, and the statement in its present form is meaningless. Perhaps it is 
more likely to mean that if measurements are taken ¢o the nearest millimetre, 
then the probability of a length of 241 mm is 4? 
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So although we can never be absolutely certain about the exact length of 
any object, and must say that 
p(length = 241 mm) = 0, 
nevertheless our original statement can meaningfully be written as: 
p(240.5 < length < 241.5) = %. 


Indeed for any continuous quantity, we cannot talk (as we did for 
discrete quantities) about the probability of it attaining a specific value. 
We can only meaningfully talk about the probability of length, time, 
etc., lying within a certain range of values. 


Cumulative probability 


Even though, for a continuous variable like length, we cannot use 
p(length = 241 mm), it would clearly be valuable if we could associate 
some form of probability with any given length. This is done by use of the 
notion of cumulative probability. 

The cumulative probability of length 241 is defined to be the probability 
that the length is less than (or equal to) 241, and is written 


@(241) = pilength < 241). 

More generally, the cumulative probability functiont for any variable x is 
defined by 
O(X) = p(x < 4), 
sothat 0 < Φ(ΧῚ <1 forall X. 
(It should be noted that we can also meaningfully talk about the cumulative 


probability of a discrete variable; and for instance if we throw a single die, 
®(5) = 2. But clearly this notion is less useful, and is less frequently used. ) 


Example 1 


In a race, the times of the competitors are measured by stop-watch, to the 
nearest τίς of a second. What is the cumulative probability function for 
error in measured time? 
Let the error be denoted by x, which is uniformly distributed over the 
interval —0.05 to 0.05. 
Therefore ®(—0.05)=0 and @®(X) =0 forany X < —0.05 
@(0.05) = 1 and Φίχ) =1 forany X 2 0.05 


and ®(X) increases steadily in value from 0 to | over the interval 


+ Also called a distribution function. 
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—0.05 < X < 0.05. 


So @(X) = Ι0(Χ + 0.05), if —0.05 < Χ < 0.05. 


Example 2 


Particles are emitted uniformly from a radioactive source in the corner of a 
room. A screen (of theoretically infinite length) is positioned one metre 
away from the source, so that all particles must strike the screen. What is the 
cumulative probability function of the distance of the point of impact from 
the bottom of the screen? 


2 


Screen 


Source Z \ A 


7 


Since the particles are emitted uniformly over the complete arc of ἐπ 
radians, 


@(X) = p (particle striking AB) 


fraction of total angle ἐπ subtended by AB 
θ 420 Φ(ΧῚ 
in 

XxX Ι 
But since tan θ = in xX 


then θ = tan ! X. 


Hence ®(X) = a tan ' X 
π 


in the interval X 0. 
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Probability density function 


The cumulative probability function, however, does have the disadvantage 
of often being cumbersome to use in practice. From its graph, for instance, 
we cannot easily picture the overall probability distribution, and we nor- 
mally use instead the probability density function (p.d.f.), derived as follows: 


By definition of D(X), we know that 
pla<x <b) = O(b) — D(a). 
Hence p(X <x < X + 6X) = OX + OX) — D(X) 
_ OX + 6X) — OX) 
OX 


ΦΙΧ + 6X) — O(X) 
— xX 


Hence p(X <x < X¥ + 6X) ® O(X) OX. 


Now as 0X — 0, we know that 


Clearly the function ®’(x) is very useful, since it enables us to find the 
probability of x lying in any small interval; and since the probability is 
equal to this function multiplied by the width of the interval, the function 
is referred to as the probability density function, and is rewritten (x). 


Since p(x) (x), it follows that 


2 


aot (x) x, where L is the least possible value of x. 


Furthermore, the-probability that x lies in the interval [a, 5] is clearly 
seen to be 


b 
| p(x) dx 


and is therefore represented by the area shaded on the diagram. 


$(x) 


It therefore follows, since (x) is an increasing function and (x) is its 
derivative, that @(x) > 0; and, since the total probability must be equal 
to 1, that 
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where L and G are the least and greatest possible values of x. 


Mechanical analogies 


* There are several analogies to probability density, and perhaps the two 
most obvious are velocity and mass density: 
(1) Probability is calculated from probability density in exactly the same 
way as distance is calculated from velocity. For if t denotes time and v(¢) is 
the velocity at time ¢, then [; v(t) dt is the distance travelled between 
times a and ὁ, just as [; (x) dx is the probability of x lying between a and ὁ: 


and these integrals are represented as shaded areas under their respective 
graphs: 


(11) Similarly, just as density measures the rate of build-up of mass, so 
probability density measures the rate of build-up of probability. Suppose 
we have a metal rod of non-uniform material which has mass | kg, length 
2 m and constant cross-section (so that we can regard the density as mass 
per unit length). Then if the density is graphed as 


mass density 


we see that most of the mass is concentrated to the right of the rod’s mid- 
point, just as in the equivalent probability distribution: 


probability 
density (x) 
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Returning to Example 1, 


@(x) = 10(x + 0.05) for —0.05 < x < 0.05 


10 for —0.05 < x < 0.05 


0 elsewhere. 


This continuous probability distribution is therefore analogous to the 
discrete rectangular distribution discussed in chapter 7. 


Similarly, in Example 2, 


D(x) = = tan” x  forx 2 0. 


80. P(x) = (x) 
: ; for x 2 0 
0 forx « 0. 


Example 3 


For a circular disc of radius 4 cm, what is the probability density function 
of the distance (r) from the centre of a point chosen at random on the disc? 
Use the p.d.f. to calculate the probability that a point lies between 2 and 
3 cm from the centre of the disc. 
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In this case it is easier first of all to calculate ®(r). 


Now @(r) = p(distance from centre < 7) 


fraction of total area whose distance < 7 
2 


πΥ 
mx 4? 


- φ0) = O(n) - 1 


4 4 γ r2 4 
and @(r)dr = | ~dr=|—] = 1, as required. 
ὃ ὃ. 8 16 | 


Now, using ᾧ (7), 


= 
NO 
In 
3 
IN 
2 
Ἶ 
NO 
ῳὺ 
ΟΟΙ π 
Ou. 
s 
Ι 
1 
a %s 
el 
No oo 
Ι 
ale 


Exercise 12.2a 
1 Ap.d_-f. is given by 


x + | 
o(x) = 


forO <x <a 


0 forx < QOandx > a. 


By using the property of the p.d.f. that | @(x) dx = 1, find the value of a. 
Obtain an expression for the cumulative probability function. 


Calculate: (1) p(x « ἢ); (1) p(x > 1). 
2 Verify that 


: fe > | 
O(x) Ξ ἰχ OTE isa p.d.f. 


0 forx < l. 


Find the value of X such that p(x < X) = 0.5. 
Calculate: (1) p(x > 10); (1) p(2< χ « 4). 


4 Ap.d.f. is given by 


ax for0 < x < 3 
φ(χ) =45(4-—x) for3<x <4 

0 for χα < Oandx > 4. 
Sketch the graph of @(x). 
Calculate: 


(2) the probability that x occurs in the interval [1, 2]; 
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(11) the probability that x > 2. 

Obtain the cumulative probability function, and hence, or otherwise, 
find the median of the distribution. 


4 Thep.d-f. ofa distribution is given by (x) = k(1 — x*) for -1 <x <1, 
and p(x) = 0 elsewhere. 

Find the value of k, and hence calculate: 
(ἢ) ρ(--ἰς <x «2; (ὦ) p(x > 5). 

Obtain an expression for the cumulative probability function of the dis- 
tribution. 


5 The p.d_-f. of a distribution is given by (x) = asin ax forO < x < 1, 
and @(x) = 0 elsewhere. 

Find the value of a, and obtain an expression for the cumulative proba- 
bility function. 
Calculate: (i) p(x < 4); (1) p(s «χ < 4). 
6 The probability that a transistor in a radio lasts less than ¢ hours is 
Ι — ε 1/2900 Find the p.d.f. for the lifetime of a transistor. 
(1) What is the probability that a transistor lasts more than 4 000 hours? 
(22) What is the probability that a transistor ceases to function after 2 000 
hours of use but before 3 000 hours. 
(11) Ifa radio contains 8 transistors, what is the probability that none of 
them fails before 1 000 hours of use? 


Parameters of a continuous distribution 


In a discrete probability distribution, the expected value and variance 
were defined by 


w=) xp, and σῇ = Σ᾽ (x, — μ)ῦ, 


τι ΝᾺ x2 by ΝΣ ye. 
In a continuous distribution, the probability corresponding to f, is given 
by (x) dx. 
So αὶ Ξῷ Ἰἴπὶ Σ᾽ x@(x) dx and o? = lim)’ (x — p)? (x) dx 
= lim δ᾿ x*@(x) dx — μ' 


G G 
> = | xp(x) dx and o7 = | (χ -- μ) Φ() dx 


L 


= | x? h(x) dx — p?. 


L 
Example 4 


The p.d.f. of the lifetime (ἢ), in minutes, of atoms of a radioactive element 
is given by p(t) = 3 e *. Find the expected lifetime of an atom, its variance, 
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and the half-life of the element (the time it takes for half of the material to 
decay). 
The expected lifetime, p, is given by 


= | 3t e~ "αἱ. 
0 


Integrating by parts, 


ie. @) 


L= [=te"*], + | a0 di 


0 


ο 3t |x 
= 0 ἘΞ 
νἀ τὶ 


+ min, 


ie. 6) 
and o7 = | 3.2 ε΄ ** dt -- ᾧ 


ue 
= 00 = 

=[-—?’e ial +| Qte-*' dt — ὁ 

ἐπ 2 1 1 

= ὁ min’. 


Now if 7 is the half-life, then 


T 
| 3. * d¢=H 
0 


Nl 


Seer 
- [-e"=4 
=> l1—e ἘΞ 
τες εὖτ-ι 
= eit —9 
=> 3T = In2 
=> T = 31n2 = 0.23 min. 


Exercise 12.2b 


1-5 Calculate the mean and variance of each of the probability distri- 
butions defined in Exercise 12.2a nos. 1=5. 


6 Arandom variable x has cumulative probability function 


0 (x <a) 
®(x) = — la<x <b). 
] (x > Db) 


Find the p.d.f. @(x), and sketch the graph of @(x). Obtain the mean and 


variance of x. (c.) 


290 PROBABILITY DISTRIBUTIONS AND STATISTICS 
7 The probability density function, @(x), of a random variable «x, is 
given by 
Q(x) = ax(4 —x) 0 <x < 4. 
= 0 otherwise. 

Find the value of ἃ, and hence find the mean and variance of x. What is 
the probability that x hes between 0 and 1? (A.E.B.) 
8 A random variable x has the cumulative probability function 

0 (x « 0) 
O(x) =< kx? (0 « x « 2) 

Ι (x > 2) 
where ἀ is a constant. Find the mean, median, and variance of x. 

(M.E.I. ) 


9 A probability distribution has the probability density function 


φῶ = 4) x <2 


ke~** x» >2, k constant. 


Find, in terms of A, the mean, median and standard deviation of the 
distribution. (M.E.I.) 


10 The probability that a light bulb lasts longer than ¢ hours is e “. 
Find the probability density function for the lifetime of a bulb. 


Show that the mean lifetime is μ. 


If the mean lifetime is 1 500 hours, how unlikely is it that the bulb will 
last more than 3 000 hours? 

If the manufacturer wants to ensure that less than one in a thousand 
bulbs fail before 5 hours, what is the lowest mean lifetime he can allow his 
bulbs to have? (S.M.P.) 


11 A random variable x has the probability distribution 
Q(x) dx = Cdx (--2α Ξ χα Ξ -a,a<x < 2a) 
o(x) dx = 0 elsewhere. 


Sketch the distribution. Obtain the value of the constant C, and the 
standard deviation,.¢, of the distribution. 
Find the probabilities of obtaining values of x 
(1) within one standard deviation of the mean; 
(11) within two standard deviations of the mean. 
Find the value of ᾧ such that p(|x| < ko) = 0.95. (M.E.I1.) 


12 A mathematical model for the fraction x of the sky covered with cloud 
(0 < x < 1) assigns to this a p.df. 
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Calculate: 


(1) the value of k; 
(11) the expected fraction covered by cloud; 
(11) the probability that not more than of the sky is covered. 


[Hint: Your integrations may be made easier by using the substitution 
x = sin* 8. You may assume that this substitution is valid, even though the 
function to be integrated may be discontinuous at the ends of the interval 
of integration. | (S.M.P.) 


13 The probability density function of a distribution is given by 


ο. Χχλ  ! 
φί(χ) = ai)! (x > 0, Ainteger > 0). 


Find the expected value and variance of x. 


Sketch @(x) when 4 = 2. (M.E.I.) 


Histograms 


In section 12.1 we saw how the relative frequency g, corresponded to the 
probability p,. Is it possible similarly therefore to find some statistical 
equivalent to the probability density @(x) ? We shall see that such a measure 
arises from the use of what is known as a histogram to display data. 

Consider the data shown below which gives the time, in seconds, between 
succeeding vehicles passing a given point on a main road, during a traffic 
survey. (Measurements were taken to the nearest second, so that a measure- 
ment registered as one second could actually be anywhere in the range 0.5 
to 1.5 seconds. All the time intervals in the table are given in this way.) 


Time interval No. of vehicles (2) Rel. frequency (g,) 


0.5-1.5 1] 0.11 
1.5-2.5 19 0.19 
2.5-3.5 23 0.23 
3.5-4.5 14 0.14 
4.5-5.5 10 0.10 
5.5-10.5 15 0.15 
10.5—-20.5 8 0.08 
100 1.00 


These results can be displayed in a relative frequency diagram: 
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relative frequency 


But it is immediately clear that this diagram gives a very misleading 
impression, and in order to correct this we require a diagram in which 
areas of the successive rectangles represent the corresponding relative 
frequencies. We therefore divide each relative frequency by the width of 
the corresponding interval. The resulting quantity is sometimes known as 
the relative frequency density, and the corresponding figure (whose total area 
is equal to 1) is a histogram: 


y 
y¥ 


densit 


0-2 


relative frequenc 


As a further extension of the traffic survey, 1 000 times between successive 
vehicles were measured to a greater accuracy, this time correct to 0.1 s, 
and the resulting histogram was: 


density 


relative frequency 
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The general form of this histogram is obviously similar to that of its pre- 
decessor; but, because the time intervals are now much smaller, the out- 
line of the histogram is correspondingly smoother. It is not difficult to see 
that as the time intervals become still smaller, the outline of the histogram 
will tend to a curve, which is therefore the statistical equivalent of the graph 
of the probability density function. 


Exercise 12.2c 


1 Conduct a traffic survey similar to that described in the text on a local 
main road, and record your results on a histogram. 


2 ‘The lifetimes (in completed weeks) of 100 mice of a certain species are 
given below. 


Lifetime/weeks | 0-9 10-39 40-49 50-59 60-69 70-79 80-99 


No. of mice 8 3 7 18 4] 22 11 


Represent the data on a histogram. 


3 Each of a group of 50 schoolboys attempts to throw a javelin for the 
first time. The distances they achieve (to the nearest metre) are given below. 


Distance/m 0-19 20-29 30-34 35-39 40-49 


No. of throws 5 11 18 13 3 


Represent the data on a histogram. 


4 120 people set out on a sponsored walk to raise money for charity. The 
distances they covered are given below. 


Distance/km | lessthan10 10-15 15-20 20-30 30-50 


No. of walkers 12 15 25 4] 27 


Represent the data on a histogram. 


5 From a sample of 100 marriages taking place at a registry office, the 
ages of the husbands are recorded below. 


Age 16-20. 21-25. 26-.ὉὮἍὮ9ὁ 341-460 41-60 


No. of husbands 10 38 35 13 4 


Represent the data on a histogram. 


12.3 The Normal distribution 


In the last section we derived methods of using probability models to 
describe continuous data. We now look in some detail at the most important 
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(though by no means the only) continuous probability distribution. We 
shall first of all see, by looking at sets of statistical data, how it most fre- 
quently arises in practice. The data given in the following three examples 
is in each case continuous and each set of data is illustrated by its corres- 
ponding histogram. 


I. Masses of five-penny pieces 


100 five-penny pieces were weighed and their masses were classified into 
intervals of 0.05 g. The resulting frequency table, with corresponding rela- 
tive frequencies and relative frequency densities was found to be as follows: 


Frequency; Rel. freq. 


9.45-5.50 
0.6 


5.50--5.55 
5.55-5.60 0.8 
5.60-5.65 3.9 
5.65—5.70 7.4 
5.70-5.75 5.4 
5.75—5.80 9.0 
5.80—5.85 0.2 


5.85-3.90 


2. Lifetimes of electric light bulbs 


The lifetimes of 200 electric light bulbs were classified into intervals of 200 
hours, as follows: 
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Lifetime/h | Frequency [ Rel. freq. | r.f.d. x 10° 


800-1 000 


1 000—1 200 0.175 
1 200—1 400 0.425 
1 400-1 600 1.075 
1 600—1 800 1.950 
1 800-2 000 0.850 
2 000—2 200 0.375 


2 200-2 400 


lifetime/h 


3. Heights of adult males 


The heights of 1 000 adult males were measured and classified into intervals 
of 2 cm: 
Height/cm | Frequency 


Height/cm | Frequency | Height/cm| Frequency 


150-152 166--[68 182--184 23 
152-154 168-170 184-186 16 
154-156 170-172 186-188 7 
156-158 172-174 188-190 2 
158-160 174-176 190-192 2 
160-162 176-178 192-194 Ι 
162-164 178-180 194-196 0 
164-166 180-182 196-198 Ι 
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From this table it is again easy to calculate the corresponding relative fre- 
quencies and relative frequency densities, which can be plotted as follows: 


τα. x 10? 


height/cm 


The three histograms appear to have the same general shape; they are 
approximately symmetrical, with most of their area concentrated in a 
central region, but tailing off fairly rapidly at each end. In the third 
example, where the total frequency and the number of class intervals are 
both greater than in the other two examples, the symmetry is more marked 
and the outline of the histogram could be approximated by a ‘bell-shaped ° 
curve. The histogram is reproduced below with such a curve superimposed, 
and similar (though less closely-fitting) curves are also superimposed on 
the other two histograms. | 


1. masses: 2. times: 3. heights 


Though all have the same general bell-shape, these three curves do differ 
in certain ways: their axes of symmetry are in different positions, and one 1s 
‘short and wide’ whilst another is ‘tall and thin’. Such differences, of 
course, follow from the differences in position and spread (however they 
may be measured — though usually by m and s) of the original sets of 
statistical data. In fact these three curves which we have superimposed 
on the histograms were carefully chosen and are defined by a single func- 
tion involving two parameters, their differences being due solely to differen- 
ces of these parameters. This function is known as the Normal probabity 
function, and satisfies all the conditions for a probability density function. 
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(The term Gaussian function is sometimes also used, after one of its earliest 
and most famous investigators, and Gauss himself called it the error function, 
from the way in which it originally arose.) 

Continuous data which has a Normal p.d.f. is said to be Normally dis- 
tributed. It must be strongly emphasised, however, that in many cases data 
are only approximately described by the Normal function. In our second set 
of data, for example, the lifetimes of electric light-bulbs, though having an 
approximately bell-shaped distribution are not exactly Normal in their 
distribution. Even so, this should not be regarded in any way as ‘abnormal’, 
for although the Normal distribution is extremely important, it is far from 
being universal. Indeed, perfect ‘normality’ is highly exceptional! 

The equation of the Normal function will be derived later in the section, 
when further properties of the Normal distribution have been discussed. 
But for the present we will simply quote its probability density function as: 


eo &—B)7/207 


The parameters p and o can be shown, as might be suspected, to be the 
mean and standard deviation of the distribution. 


g(x) 


Properties of Normal probability curve 


1 ‘The curve is symmetrical about the line x = μ. 
2 ‘The curve is asymptotic to the x-axis, i.e., (x) > Oasx—> +0. 


3 The total area beneath the curve equal to 1. This is, of course, a neces- 
sary condition for @(x) to be a p.d.f., and also accounts for the value of the 
constant | i (27)o in the equation for @(x). 


4 Just over 68% of the area beneath the curve lies between p — o and 
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u + σ, so that approximately 3 of the data lies within one standard devia- 
tion of the mean. 


Similarly 955°% of the area lies between p — 20 and p + 20, 
and 993° lies between μ — 30 and uw + 3σ. 


The standard Normal function 


We now develop a single method of dealing with all Normal probability 
distributions, irrespective of the values of their parameters p and o. This 15 
done by standardising the variable, that is transforming it so that its new 
mean and standard deviation become respectively 0 and 1. 


Example 1 


The marks in a mathematics examination are found to be approximately 
Normally distributed with mean 56 and standard deviation 18. Standardise 
the marks and find the standardised equivalent to a mark of 70. 


(Examination marks, of course, are discrete, rather than continuous, 
variables; nevertheless, as we shall see in later examples, a continuous 
probability curve can give us a good approximation to a discrete statistical 
distribution.) 


Let us denote the examination marks by the variable x. We first reduce 
the mean to zero by subtracting 56 from each mark. So x becomes x — 56. 
As the mean of our new variable, x — 56, is now at the origin, and 
standard deviation measures spread from the mean, we reduce the standard 
deviation to 1 by dividing each new mark by 18. So x — 56 becomes 
(x — 56)/18. Therefore (x — 56)/18 is now a standardised Normal variable. 


70 — 56 
So a mark of 70 becomes aa = 0.78. 


If we now look at our Normal probability curve, in geometrical terms the 
standardisation process can be regarded as two successive transformations : 


(1) a translation with vector ¥ ) 


(ii) a two-way stretch with factor 1/o parallel to the x-axis and factor o 
parallel to the y-axis (so that the total area under the curve remains 
constant). 
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(1) translation 


(11) two-way stretch 


Hence the general standardised variable now becomes (x — p)/o. 


Writing the standardised variable (x — p)/o as ¢t, the standard Normal 
function is therefore given by: 


2 
ge 


Since the p.d.f., @(¢), is not integrable, in practice we use the equivalent 
cumulative probability function ®(é). 
In geometrical terms, ®(t) represents the shaded area on the diagram: 


φ(ὴ 


300 


Values of Φ(ἢ are tabulated below: 
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t | (2) t | (2) t | (2) t | O(2) 
0.0 | 0.5000 | 1.0 | 0.8413 | 2.0] 0.9772] 3.0 | 0.998 65 
0.1 | 0.5398] 1.1 | 0.8643 | 2.1 | 0.9821 | 3.1 0.999 03 
0.2 | 0.5793 | 1.2 | 0.8849 | 2.2} 0.9861 | 3.2 | 0.99931 
0.3 | 0.6179 | 1.3 | 0.9032 | 2.3 | 0.9893 | 3.3 | 0.999 52 
0.4 | 0.6554 | 1.4 | 0.9192 | 2.4 | 0.9918 | 3.4 | 0.999 66 
0.5 | 0.6915 | 1.5 | 0.9332 | 2.5 | 0.9938 | 3.5 | 0.999 77 
0.6 | 0.7257 | 1.6 | 0.9452 | 2.6 | 0.9953 | 3.6 | 0.999 84 
0.7 | 0.7580 | 1.7 | 0.9554 | 2.7 | 0.9965 | 3.7 | 0.999 89 
0.8 | 0.7881 | 1.8 | 0.9641 | 2.8 | 0.9974 | 3.8 | 0.999 93 
0.9 | 0.8159 | 1.9 | 0.9713 | 2.9 | 0.9981 | 3.9 | 0.999 95 
1.0 | 0.8413 | 2.0 | 0.9772 | 3.0 | 0.9986 | 4.0 | 0.999 97 


Two points should be noted about the use of such a table of cumulative 


probabilities : 


(1) Intermediate values 


More accurate statistical tables are generally available, but otherwise 
linear interpolation must be used to evaluate ®(t) for values of ἐ not given 
in the table. However, the values of ®(t) so obtained should be regarded 
as accurate only to 3 decimal places for values of t < 3, e.g., 


(1.37) + O(1.3) + [@(1.4) — O(1.3)] 
~ 0.9032 + τ x 0.0160 


= 0.903 2 + 0.011 2 
~ 0.914, to 3 dp. 


(1) Negative values 


No negative values of ¢ are given in the table since Φί -- ἢ can be calculated 
very easily from (ft). 
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By symmetry, the two shaded areas on the diagram are equal. The areas 
represent Φί -- ἢ and 1 — @(#), so it follows that: 


®(—1) = 1 — @(2). 


So, for instance, @O(—0.6) = 1 — (0.6) 
= ] — 0.7257 
= 0.274 3. 


The following two examples illustrate practical applications of the 
Normal distribution: 


Example 2 


Assuming that intelligence quotients are Normally distributed with mean 
100 and standard deviation 15, calculate: 

(1) the probability that a person chosen at random has an I.Q. between 
88 and 118; 

(11) the percentage of the population with I.Q. greater than 130. 

To standardise the variable in this case, we subtract 100 and divide by 15. 
Therefore: 


(ὦ) p(88 «1.0. < 118) 
= ρ(1.0. < 118) — p(I.Q. < 88) 


118 — — ] 
_ 8 — 100 =a 88 00 
15 15 


= 0(1.2) -- Φ(--0.8) 
= 0(1.2) — [1 -- Φ(Ο.8)1 
= @(1.2) + Φ(0.8) — 1 


= 1— ®(2) 
= 1 -- 0.9772 
= 0.022 8 


So approximately 2.3% of the population have I.Q. greater than 130. 


Example 3 


Machine components have lengths which are Normally distributed with 
mean 57.2 mm. It is found that 13% of the components have lengths greater 
then 57.4 mm. 
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(2) What is the standard deviation of the component lengths? 
(11) What length will be exceeded by 20% of the components? 


(2) We will denote the standard deviation by o. 
Now /(length < 57.4) = 1 — p(length > 57.4) 


97.4 — 57, 
=> (555. - 3:1 = ] -- 0.13 
σ 


=> o(°2) =. 0.67. 


σ 


But from the Normal probability table, ®(1.13) = 0.87. 


So Oe =. ὉΠ 
σ 
0.2 
a= 1) - 0.18 mm (correct to 2 d.p.). 


(11) Let the required length be denoted by X 


then P(length > XY) = 0.2 
=> p(length < X) = 0.8 


= (a) -- 0.8. 


0.18 
But (correct to2d.p.) (0.84) = 0.8 
Me ee 0.84 
0.18 
=> X — 57.2 = 0.84 x 0.18 = 0.15 
=> X = 57.35 mm. 


Exercise 12.3a 


1 The mean weight of 100 sixth-formers at a certain school is 63 kg and 
the standard deviation is 7 kg. Assuming that the weights are approxi- 
mately normally distributed, estimate how many sixth-formers weigh: 
(2) more than 70 kg; (11) less than 73 kg; (11) less than 60 kg; (2v) between 
61 and 65 kg. 


2 ‘The times taken by a large group of students to complete a project are 
approximately normally distributed, with mean 30 hours and standard 
deviation 4 hours. Estimate the percentage of students who: 

(2) spend more than 36 hours on the project; 

(22) finish the project in less than 25 hours; 

(111) finish the project in less than 20 hours. 


3 <A factory produces ball-bearings whose masses are normally distributed 
with mean 45.32 g and standard deviation 0.17 g. What is the probability 
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that a ball-bearing chosen at random will have a mass: (1) more than 
45.40 g; (11) more than 45.50 g; (111) less than 45.00 g; (17) between 45.20 
and 45.45 g? 


4 Inacertain book the frequency function for the number of words per 
page may be taken as approximately Normal with mean 800 and standard 
deviation 50. If I choose three pages at random, what is the probability 
that none of them has between 830 and 845 words? (S.M.P.) 


5 The daily delivery of mail at a biological research station follows a time 
pattern conforming to the normal distribution, with a mean time of arrival 
at 8.40 a.m. and with a standard deviation of 20 minutes. 

Estimate the number of occasions during the 250 working days in the 
year when the mail arrives 
(1) before the main gates open, at 8.00 a.m.; 
(11) after the arrival of the office staff, at 8.20 a.m. ; 
(111) during the Director’s daily meeting with heads of research sections 
(9.00 a.m.—9.20 a.m.). (A.E.B.) 


6 The heights in centimetres of a sample of 700 six-month old babies, 
attending a post-natal clinic, are given in the following frequency table: 


Height/em | 62 63 64 65 66 67 68 69 70 71 72 | Total 


Frequency | 25 35 52 84 120 135 101 61 40 33 14 | 700 


Calculate the mean and standard deviation of the heights. 

Assuming that the heights of all such babies are normally distributed 
about this mean with this standard deviation, estimate: 
(1) the percentage of all six-month old babies of height 70 cm or more; 
(11) the height that will be exceeded by 60% of all babies of this age. 


7 ‘The table shows the frequency / with which x o-particles were radiated 
from a source in a given time, the values of x being integers only. 


Calculate the mean value of x and its standard deviation. Assuming that 
the sample fits a normal distribution, find the value of x that will be exceeded 
on 70% of occasions. (M.E.I.) 


8 If x 1s Normally distributed with mean yp and standard deviation o, 
calculate the value of A such that: 

(i) p(x — μὶ < Ao) = 0.5; 

(1) p(jx — μ < Ao) = 0.9. 


g Packets of soap-powder are filled in such a way that the masses of their 
contents are normally distributed with mean 520 g and standard deviation 
15 g. The cartons are nominally of 500 g. 
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(2) Calculate (correct to 1 d.p.) the percentage of packets that will be 
‘under-weight’. 

(1) Calculate the percentage of packets containing more than 5% above 
their nominal contents. 

(111) If the mean mass of soap-powder can be altered, without affecting 
the standard deviation, what.should the new mean be to ensure that only 
2% of packets are under-weight? 


10 Hens’ eggs have mean mass 60 g with standard deviation 15 g, and 
the distribution may be taken as Normal. Eggs of mass less than 45 g are 
classified as ‘small’. The remainder are divided into ‘standard’ and 
‘large’, and it is desired that these should occur with equal frequency. 
Suggest the mass at which the division should be made (correct to the 
nearest gram). (S.M.P.) 


11 Inacross-country race the times taken by competitors to complete the 
course are approximately normally distributed with mean 47 minutes and 
standard deviation 9 minutes. The race organisers wish to split the com- 
petitors, on the basis of their times, into 5 equal categories, very slow, slow, 
average, fast and very fast, so that there will be an equal number of com- 
petitors (as far as possible) in each category. 

(1) What is the least time in which a competitor must complete the race 
to be recorded as ‘very fast’ ? 

(1) Between what times will a competitor be recorded as ‘average’ ? 

(222) Between what times will a competitor be regarded as slow? (Give 
answers in minutes, correct to | d.p.) 


12 A machine produces components to any required length specification 
with a standard deviation of 1.40 mm. At a certain setting it produces to a 
mean length of 102.30 mm. Assuming the distribution of lengths to be 
Normal, calculate: 

(1)] what percentage would be rejected as less than 100 mm long; 

(1) to what value, to the nearest 0.01 mm, the mean should be adjusted 
if this rejection rate is to be 1%; 

(111) whether at the new setting more than 1% of components would 
exceed 107 mm in length. (M.E.I.) 


13 Machined components are accepted if they pass through a gauge of 
1.040 cm and do not pass through a gauge of 0.960 cm. It was found that 
over a period of production the percentages of components rejected by the 
larger and smaller gauges were 3.5 and 1.5 respectively. Assuming that 
the distribution of the dimension tested is normal, find the mean and the 
standard deviation. (O.c.) 


14 A manufacturer hopes, for the sake of his reputation, to produce 
an article of such quality that not more than 5% of the articles may be 
expected to last for less than 6 months; and for reasonable economy in 
manufacturing costs, that not more than 15% may be expected to last for 
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more than 20 months. He can control quality to give (assuming normal 
distribution) a pre-determined average yw and standard deviation a for the 
durability of his product. At what values of μ᾿ and o should he aim? (α.) 


15 Skulls may be classified into 3 types according to an index based on 
their length—breadth ratio: 


type A, under 75; 
type B, between 75 and 80; 
type C, over 80. 


A large number of skulls are examined and it is found that 58% are of 
type A, 38% of type B and 4% of type C. Assuming that the length—breadth 
ratio is normally distributed, determine the mean and standard deviation 
of the length—breadth ratio of the skulls. (c.) 


16 A man leaves home at 8.00 a.m. every morning in order to arrive at 
work at 9.00 a.m. He finds that over a long period he is late once in forty 
times. He then tries leaving home at 7.55 a.m. and finds that over a similar 
period he is late once in one hundred times. Assuming that the time of his 
journey has a normal distribution, before what time should he leave home 
in order not to be late more than once in two hundred times? (S.M.P. ) 


17 Ina particular school 1 460 pupils were present on a particular day. 
By 8.40 a.m. 80 pupils had already arrived, and at 9.00 a.m. 12 pupils 
had not arrived but were on their way to school. By assuming that the fre- 
quency function of arrival times approximates to Normal form, use tables 
to estimate: 
(2) the time by which half of those eventually present had arrived; 
(11) the standard deviation of the times of arrival. 

If registration occurred at 8.55 a.m. how many would not have arrived 
by then? 

If each school entrance permitted a maximum of 30 pupils per minute to 
enter, find the minimum number of entrances required to cope with the 
‘peak’ minute of arrival. (S.M.P.) 


18 The length of a certain mass-produced item is a normal variable with 
mean 24.3mm and standard deviation 0.4 mm. Items are rejected if 
their lengths are below 23.7 mm or above 25.2 mm. Calculate the propor- 
tion of items that are rejected, and the proportion of rejects that are too 
long. The machine making these items is adjusted with the intention of 
reducing the proportion rejected; the standard deviation of length cannot 
be altered, but the mean can. State what value of the mean leads to the 
minimum proportion of rejects, and give the value of this proportion. (c.) 


The limiting binomial distribution 


The Normal probability distribution has so far arisen as a suitable model to 
describe certain statistical data, drawn from a wide variety of sources. We 
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now see how it also arises purely from considerations of probability, as the 
limiting case (for large values of n) of the binomial distribution. 

The probability diagrams below illustrate the binomial distributions with 
p = 5 and ¢ for values of n = 4, 10 and 20. 


In the first case, with p = 4, the distinctive normal bell-shape is immedi- 
ately apparent, even for relatively small values of n. And for n = 20 the 
outline of the probability diagram approximates very closely to the Normal 
curve. With a symmetrical binomial distribution perhaps this is not so 
surprising, but with an apparently asymmetrical distribution, such as we 
have with p = 4, the result is much more striking. Indeed, for small values 
of n (e.g., n = 4), the probability diagram bears no resemblance at all to 
the symmetrical Normal shape; but when ἡ = 10 we see that the diagram 
is much less skew, and when n = 20 we once again have a diagram whose 
outline is very similar to the characteristic Normal shape. It is not difficult 
to envisage that as n becomes larger and larger, and the horizontal and 
vertical scales are suitably adjusted (maintaining a total area of 1), the dia- 
gram tends closer and closer to the Normal curve. The two diagrams with 
n = 20 are reproduced below, with Normal curves superimposed. 
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We know, from section 12.1, that for a binomial distribution 


=np and o = ./npq, 


which therefore gives us the values of μ and o indicated on the diagram. 


* Derwation of the Normal probability function 


We can now derive the equation, stated earlier in this section, of the 
Normal probability function, by regarding it as the limit of the binomial 
distribution. 

The mathematics involved in this derivation is not particularly easy, 
and it should not be regarded as essential to follow the working through in 
detail. 

We have seen that as n—> οὐ the profile of the binomial distribution 
tends: 

(2) to move to the right (since np = np); 


(1) to spread (since σ = ./(npq)); 
and (111) to become flatter. 
Now the first of these can be controlled (or standardised) by letting 


“=r — np, 
and the second by letting 


u r — np 


= Ton J (np) 


But this second transformation is equivalent to diminishing all widths by a 
factor 1/,/npq; so if the area beneath the profile is to remain 1, each of the 
ordinates must be stretched by a corresponding factor <p (npq) which (hope- 
fully) will also counteract the flattening. 


pr 
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By this means we can bring each distribution into a standard position 
with w = 0 and o = 1, and so can more easily investigate the alterations 
of its shape (rather than of its position and spread) as n— οὐ. 

If we now consider the two neighbouring ordinates p, and p,,, of the 
original distribution, we see that each is stretched by a factor ,/ (npq), whilst 
the distance between them is reduced from 1 to 1/,/(npq). Furthermore, we 
suppose that as these ordinates draw closer together, the distribution tends 
to one with probability density function @(t): 


g(t) 


If the ordinates p, and p,,, are transformed into φ(ἢ) and @(t + δὲ), it 
follows that 


b(t) = /(npq) p, and dt = 
P(t + dt) = ./(mpq) Prat: 
Hence 0¢= O(t + δὴ — φ(ἢ 
= ψ{πῤ4) (Pro. — Pr) 


OP _ Pr+ 7 Pr Peta 1. 
φ ῤ, ῤ, 


n = nt r n-r—- 
But ῤ, -- (ἣνν d and Pr+1 Ξτ ( fe Ν 1g 
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So —_— ~ — | 
φ (ν» 
r 
(n -- ἃ ἡ _ 
(r+ l)q 
ΠῚ eh el ae, 
(r+ 1)q (r+ l)q 


But r= np + J (npq)t and ot = Tee 


16 [-- v(npa)t -- φ] (np9) 
φ ot — [np + /(npq)t + 112 
w~ mbt — (npg) 
np + J (npq)t + 1 
~ τῆπ ψίφη), 
~ 1+ ψίψηρνε + Uap 


Letting n — oo (and so δί > 0), 


Ee a. 
φ di 
=> Indg(t) = —3t + Inc 
- gl) = ce" 
Hence p(t) dt = a) e Ἐπ’ dt. 


But ἽΝ p(t) dt = 1, since φ(ἢ is ἃ p.d.f., and we have also seen (Miscel- 
laneous problems, chapter g) that ι e7* dx = xf 7, which implies that 
joe dia) On). 


] 
Hence ¢ = ——— 


/ (2m) 


and the limiting curve is 


Finally, we can reverse the process of standardisation by the transforma- 
tion 
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and thereby obtain the probability density function of the more general 
Normal distribution as 


eo (“τ B)?/207 


We are now able to put the limiting property of the Normal distribution 
to practical use, as illustrated in the following example. 


Example 4 


Forty per cent of university mathematics students are female. Assuming 
that these young ladies are scattered randomly throughout the country, 
what is the probability that a mathematics department of 100 students 
contains at least 50 women? 

The probability model that describes this situation is a binomial one, 
with n = 100, p = 0.4, ¢ = 0.6. 

The required probability is therefore given by: 


p (at least 50 women) = ps9) + 351 δ᾿ Pioo 


100 5 50 
= faa (0.4)°°(0.6) 
100 100 
+ ἫΝ (0.4)51(0.6)45. -«...- et (0.4)10°. 


The evaluation of such a probability would be, to say the least, tedious. 
But to ease the calculation, we can now use the Normal approximation to 
this binomial distribution, with 

pL = np = 100 x 0.4 = 40 
and o = ,/npq = ,/(100 x 0.4 x 0.6) = 4.9. 


Now if we denote the number of female students by ἢ, then we require 
pif = 50) = 1 — pif < 49). 

Using the Normal distribution therefore it would seem reasonable to 
say that 


49 — 40 


but this is not quite true and the discrepancy is illustrated in the diagram 
opposite. 


The shaded areas represent, on the left p(f < 49) and on the mght 
@[ (49 — 40)/(4.9)]. The difference between them is fairly evident, and 
arises from the fact that we are using a continuous curve to approximate 
to a discrete probability distribution. What we need to do is to shift the 
boundary of the shaded area on our right-hand diagram half a unit to the 
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right. We are then considering ®[(49.5 — 40)/(4.9)], which gives a 
better approximation to p(f < 49). Such a modification is known as a 
continuity correction. (For very large values of n the correction is insignificant, 
and so may be disregarded. ) 


9.5 
< = τε πὶ 
Hence p(f < 49) =@ (75) 


= (1.94) 
= 0.974 
ze p(f > 50) = 1 — 0.974 = 0.026. 


So the probability of a department of 100 containing at least 50 women is 
0.026. 


For a general binomial distribution, where ἡ is large, and the probability 
of ‘success’ = p. 


N = 
p (at most N successes) = (55: ¥ ἜΣ 2 wt), 


/npq 


Exercise 12.3b 


1 Ifan unbiased coin is tossed 100 times, what is the probability that 
(1) there will be more than 60 heads? 

(11) there will be at least 45 and at most 55 heads? 

(111) there will be fewer than 43 heads? 


2 Ifan unbiased coin is tossed 1 000 times, what is the probability that 
(1) there will be more than 600 heads? 

(1) there will be at least 450 and at most 550 heads? 

(122 


3 Ifa ‘fair’ die is thrown 300 times, what is the probability that 
(1) there will be more than 60 sixes? 
(11) there will be fewer than 45 sixes? 
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4 In an examination which consists of 100 questions, a student has a 
probability of 0.6 of getting each question correct. The student fails the 
examination if he obtains a mark less than 55, and obtains a distinction for 
a mark, of 68 or more. Calculate: 

(2) the probability that he fails the examination; 

(11) the probability that he obtains a distinction. 


5 Ifit is known that 30% of people are short-sighted, what is the proba- 
bility that more than 33% of a sample, chosen at random, will be short- 
sighted if: 

(1) the sample consists of 100 people; 

(11) the sample consists of 1 000 people; 

(111) the sample consists of 10 000 people. 


6 Itis known that 72% of TV viewers watch a particularly popular pro- 
gramme. What is the probability that in a sample of 500 viewers, chosen at 
random, 

(¢) more than 350 watch the programme? 

(1) more than 375 watch the programme? 

(111) fewer than 340 watch the programme? 


7 <A playing card is drawn at random from a full pack, containing no 
jokers, its suit is recorded and it is then returned to the pack; the pack is 
then shuffled and the procedure repeated. In 60 such draws calculate, using 
a Normal approximation, the probability that: 

(1) no more than 12 hearts appear; 

(11) more than 20 spades are obtained. 


8 Aconfectionary firm produces three types of toffees, liquorice, nut and 
plain, and mixes them together in the ratio 1:2:5 before packing them 
into boxes. If there are 80 toffees to a box, what percentage of boxes will 
contain: 

(1)] more than 25 nut toffees? 

(11) less than 58 plain toffees? 

(11) more nut and liquorice than plain toffees? 


g Inacertain manufacturing process a 10% rate of defectives is regarded 
as Just tolerable. It is decided to accept a day’s batch if in a random sample 
of size n taken from it the proportion of defectives does not exceed 12%. 
Calculate the value of n if there is a 0.05 probability of rejecting a batch 
which is in fact producing 10% of defectives. 

The percentage of defectives rises to 14%. The procedure described 
above is used with a sample of size 600; calculate, to 2 decimal places, the 
probability that as a result of the test the batch is accepted. (M.E.I1.) 


The central limit theorem 


A computer may be used to produce the binary digits 0 and | in a random 
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fashion. Then if x denotes the value of the digit so obtained, 


p(x =0) = p(x =1) = 3. 


=> 6, = 0.5. 


Now suppose that instead of looking at the digits individually, we group 
them together in sets of, say, 10. And for each group we will consider a 
new variable, y, the sum of the digits. 

y can therefore take values from 0 to 10, and since each digit can have 
only one of two values, y has a binomial probability distribution with 
n= 10, =3,q9 - ἐ. 


Hence Hy = mp =5 
and a, = Jnpq = 20. 


y represents the sum of 10 digits. We now define a third variable, z, to be 
the average of each group of digits, so that z = oy. 

Since Ζ is a constant fraction of y, its probability distribution will have 
the same shape as that of y, shown in the diagram, but the horizontal scale 
will be reduced by a factor 10. 


0.5 


So μς = Toby 


and Oo, = 109, tov 2.5 = Ji10 


H d = 
ence tt, =p, and a, J 10 
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Therefore if we consider the averages of groups of 10 digits, the expected 
value is the same as that of the digits taken individually, but the standard 
deviation of the averages (or ‘sample means’) 1s a fraction ly. 10 (1.e., 
about 3) of the standard deviation of the individual digits. 

Moreover, despite the fact that the original distribution was rectangular, 
the distribution of sample means appears to be approaching the Normal 
distribution. This represents a special case of one of the most important 
and remarkable theorems in statistics, known as the central limit theorem, 
that ifsamples of size n are drawn at random from any background popula- 
tion then, as ἡ -- οὐ, the distribution of the sample means tends to the 
Normal distribution. (Although we generally refer to the sample means in 
the central limit theorem, the theorem clearly also holds for the sample 
sums. ) 

Having already discovered the limiting property of the binomial dis- 
tribution it is perhaps hardly surprising that the theorem should hold for 
our example of the binary digits. But suppose we consider the sample 
means of sets of 10 random digits in the range 0-9; the frequency diagram 
below illustrates the sample means, suitably grouped, of 50 samples taken 
from a table of random digits. 


frequency 


2-0 30 40 5-0 6-0 7-0 
sample mean 


The sample means in this case do not have a distribution in any way bino- 
mial; nevertheless, as predicted by the central limit theorem, the Normal 
shape of the distribution is again apparent. 


So, in yet another way, is emphasised the importance of the Normal 
distribution. 


Exercise 12.3c 


1 Using a table of random digits (0-9), obtain the sample means of 100 
sets of ten digits. Display these sample means on a frequency diagram 
similar to that shown in the text. 


2 What would have been the values of the expected value and standard 
deviation of the sample mean of the binary digits, described in the text, if 
the sample had been: (1) of size 25; (11) of size 100? 
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3 A computer is now programmed to produce randomly the digits 0, 2. 
Calculate the values of μὶ and o for this simple rectangular distribution, 
and hence obtain the expected value and standard deviation of the mean 
of a sample of size 10. 


4 The digits 1, 2, 3, 4 are produced randomly. By applying the results 
obtained for binary digits, calculate the expected value and standard 
deviation of the mean of a sample of 100 such numbers. Hence, using the 
central limit theorem, estimate the probability that the mean of such a 
sample exceeds 2.7. 


12.4 The Poisson distribution 


In 12.3 we discovered that, for large values of n, the Normal distribution 
gave us a good approximation to the binomial distribution, and is in fact 
the limit of the binomial distribution as n > oo. 

Consider, however, the binomial probabilities with n = 1000 and 
p = 0.001 and their Normal approximations. 


Binomial 


Probability Normal 


Here, despite the fact that n is very large, the Normal distribution seems 
to give us a poor approximation. This apparent contradiction of our previ- 
ous result arises from the fact that in section 12.3 we assumed that as n 
became very large, so did the mean, (= np), with the effect that the 
binomial distribution became approximately symmetrical about this mean. 
For the distribution illustrated above, however, this is not so; for, even 
though 7 is large, 


μ τε np = 1000 x 0.001 = 1, 


a relatively small value, and the distribution cannot be symmetrical 
about μ. 

In fact the limiting Normal property of the binomial distribution holds 
only if 
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μτηρ-ο oOo asn- οὐ 


(an assumption which was made in the last section). In many cases, of 
course, this is so, but if np is constant as ἢ — oo, then the limit of the 
binomial is quite different, as we shall now show. 


We know that for the general binomial distribution, 


Py 


II 
fo 
=~ 
ee 
— 
| 
> 
= 
| 
Ἕ 


or aoe a 


Now since 7 is finite, as ἢ — οὐ 


oi 


and ἵ -#)" - ee ec oe ee 


n 2! n 
(a finite series) 


ἘΠ an ae (an infinite series) 


and the distribution so defined is known as the Poisson probability distribution. 


The distribution can also be obtained by considering its probability 
generator: 


The probability generator of the general binomial distribution is given by 
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G(t) = (q + pt)" 
= (1+ p(t — 1))" sincep+q=1 


== ( + ad since HW = np. 


n 
Now we know (Miscellaneous problems, chapter 9) that if x is finite, 
x n 
(: +4) >e asn— oO. 
n 


So G(t)—> ἐμ 1) asn— οὐ. 


Now ¢#@—) = eH x e7# 


2 42 rp 
( ae EE de 
2! r! 


I 


r 


ne Ye ee ἐλ "Ἐν 
7. 


r 
so the coefficient of f gives p, = sa gr, 
r! 
Returning to the original binomial distribution, if we now use the 
Poisson distribution with μ᾽ = 1, we obtain probabilities which are very 
good approximations to the binomial, and these equivalent binomial and 
Poisson probabilities are set out in the table below. 


Probability Binomial Poisson 


So we now have, under different circumstances, another limit to the 
binomial distribution, but with the important difference that this limiting 
distribution, like the binomial itself, but unlike the Normal distribution, 
is discrete. 

The probability diagrams for the Poisson distribution with μ = 0.5, 
2 and 5 respectively, are illustrated overleaf. 


It can be seen from these diagrams that, as up becomes larger, the Poisson 
distribution becomes more symmetrical in shape and, like the binomial, 
approximates to the Normal distribution. 
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Calculation of Poisson probabilities 


In the actual computation of probabilities, the Poissqn distribution has 
one big advantage over the Normal and binomial distributions; it involves 
the use of only one parameter, μ, as opposed to two in each of the other 
cases (μ, σ for the Normal and ἡ, p for the binomial). And though the 
formula for p, may seem at first glance rather daunting, in practice the 
calculation of probabilities can be carried out very conveniently: 


Suppose p = 3, 
then i 9 = 0.049 8 
p, =3e¢ τ 3p, = 0.1494 
42 -3 
ΓΞ 7 = 36, = 0.2241 
33 =—3 
ps3 = 7 — Ξ = 0.224 ] 
Qs — 3 
pa = -- = tps = 0.168 | 
3 | μ 
ands 1 = aa 7 Pr soingeneral f,4, = τιν ; 


Mean and variance of Poisson distribution 
For the binomial distribution, 

μ - np 
and o* = npg = ηρ(] -- p). 


Now we can regard the Poisson distribution as the limit of the binomial 
distribution as ἡ — οὐ and np = μ (so that p— 0). 


Hence mean = μ 


and variance = lim np(l— p) =u. 


12.4 THE POISSON DISTRIBUTION 319 


Applications of the Poisson distribution 


We have derived the Poisson distribution as a limiting case of the binomial 
distribution. But it arises more fundamentally in its own right, whenever 
there is a continuous variable — say time or distance — and discrete events 
occur randomly and independently. In such a case, the number of events in a 
given interval has a Poisson distribution. (By imagining the continuous 
variable split into a very large number of equal small intervals, the reader 
can verify for himself that these conditions are consistent with those of a 
limiting binomial distribution.) Examples of such situations, in which the 
Poisson distribution is applicable, are provided by 


flaws occurring in lengths of steel piping, 
people arriving in a shop queue in a certain time interval, 
and radio-active particles being emitted from a source in some time interval. 


Consider the first of these examples: 


Example 1 


It is found experimentally that there is an average of one flaw in every 
2 metres of a certain type of steel piping. If the piping is cut into 1-metre 
lengths what is the probability of 0, 1, 2 flaws in such a length? Find the 
equivalent probabilities if the pipe is cut into 4-metre lengths. 


(1) Fora 1-metre length of pipe, the expected number of flaws, μ = 3. 
Hence fy = 64 2 = 0.606 5. 


Then using the relationship we have recently obtained between succes- 

sive Poisson probabilities, 

2, = $f = 0.303 2 

xp 
and p, = a = 0.075 8. 
(1) For 4-metre lengths of pipe, μ = 2. 
Hence po =e * = 0.1353 
2 
ioe τι = 0.2706. 


It was quite clear that Example 1 should be tackled by means of the 
Poisson distribution. By comparison, in the following example we can use 
either a binomial or a Poisson probability model, and the equivalent 
results for the two models are compared. In example 3 we fit a Poisson 
model to experimental data. 
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Example 2 


A firm of wholesale fruit distributors has found that on average one apple 
in 50 is bruised on arrival from the growers. If the apples arrive in cartons 
of 100, calculate the probabilities of a carton having 0, 1, 2, 3, more than 
3 bruised apples. 

Using a binomial model, n = 100 and p = <5. 

Using a Poisson model, p = 100 x τς = 2. 


The calculations for the two models are carried out simultaneously: 


binomial Poisson 
49 100 
= = = =e — 
Po (=) 0.1316 fo = ὁ 0.135 3 
id gee = 0.2706 = 2p, = 0.2706 
βι ᾿ 50) 50 =r Pi = ερο = YU. 
Ξ meee = 0.2727 = = 0.2706 
a Ὁ}: ee ae δ᾽ ὁ = ἐρ, = 0.180 4 
p3 = 3 50 7) P3 = sho = YU. 
p(> 3) = 0.143 3 p(> 3) =0.143 1 


The results clearly are very similar but, since its calculations are so much 
easier, we would usually adopt the Poisson model. 


Example 3 


The following classical data (without which any account of Poisson proba- 
bility would be incomplete!) gives the number of deaths by horse-kick of 
Prussian cavalrymen in ten corps during the years 1875-1894. 


No. of deaths per corps per year | 0 Ι 2 3 4 


Frequency of occurrence 109 65 22 3 ] 


We will fit a Poisson probability model to the data, and calculate the equi- 
valent theoretical frequencies. 


The mean number of deaths per year is given by 
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109 x 0+65x1+22x*24+3x341x4 
ἘΞ 200 
122 
~ 200 
= 0.61. 


We will use this mean value as our parameter py in the Poisson distribution. 
To obtain the theoretical (or expected) frequencies, we multiply the 
probabilities by the total frequency, 200, and round off to the nearest whole 
number. 


No. of 
deaths 


Observed 
frequency 


Expected 
frequency 


Probability 


* Alternative derivation of Poisson distribution 


To complete our account of the Poisson distribution, we now derive it 
directly from our basic assumptions. 

Let us suppose that a series of events, such as the flashes of a Geiger 
counter, occur independently; and yet, over a long interval, at a uniform 
rate, so that in an interval of unit length the expected number of flashes is A. 
Then in a very short interval δί, the probability 

of a single flash is Adt, 

of two or more flashes is 0, 
and of no flashes is 1 — Adt. 
We now denote the probability of having 7 flashes in the first ¢ units of 
time by #,(t). Then, if we consider a further small interval δέ, it is clear that 
p(r flashes in time ¢ + δὴ 
= p(r flashes in time ¢ and no flashes in interval δι) 

+ p(r — 1 flashes in time ¢ and 1 flash in interval δι) 

=> p,(t + dt) = p,(t)(1 — A δὴ + p,_ (bd δι 


C2 ONS, 
PA PO ap lt) = λρ,. (ἢ. 
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Letting dt > 0, we obtain 
py (t) + Ab, (t) = Ap,—1(t) 


d 
τ dt [e“*p, (t)] =A eb, 1 (ἢ 


t 
0 


=> — [e%p,()To = A | eb, — 1 (t) αἱ. 


But p,(0) =0 ifr £0, 
ΐ 


so ep (t) = 1) e“th _ κ(ἢ dt. 


0 
Furthermore, 
p (0 flashes in time ¢ + δὴ) 
= p(0 flashes in time ¢ and Ὁ flashes in interval δὴ 
=> Polt + Ot) = po(t) (1 — ὁ δὴ 


ῥοί! + δὴ — fo(t) 
ot 


Letting dt — 0, 
Po(t) + Apo(t) = 0 
d 
aaa [e“po(t)] = 0 


= e“bo(t) = fo(0). 
But we certainly start with Ὁ flashes, so p)(0) = 1. 


+ Ap o(t) = 0. 


Hence e“p,(t) = 1. 


If we now put P,(t) = στρ, (ἢ), we can rewrite equations (1) and (2) as” 


P,(t) =A \ ᾿Ρ,- (ἢ dé 


0 
and P,(t) = 1 
t 
So P,(t) = i | dt = λί => p(t) = Ate 5; 
0 
t 242 242 
Pi(t) = 1] At dt = ce Ξ pst) = a 
0 
t 42? 3 41. of 
P(t) =A oe ge => p(t) = τὺ Ἄς 


and, more generally, 
Net 


r! γι 


P,(t) 


(2) 
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Finally, regarding ¢ as fixed, we can write p,(¢) as p,; and since the 
expected number of flashes in this interval is At, we write μ = At and obtain 


r 


aoe Se a 
py es ; 


Exercise 12.4 


1 It is known that 0.1% of all people react adversely to a certain type of 
drug. What is the probability that out of a sample of 1 000 people: 

(1) none will react to the drug; 

(11) just one person will react; 

(111) more than two people will react? 


2 Ahiker ona walking holiday in a country area estimates that in walking 
150 kilometres he has passed a total of 60 public houses. Assuming that 
these public houses are distributed fairly randomly about the countryside, 
what is the probability of his being able to quench his thirst: 

(2) within the next kilometre; 

(ii) within the next five kilometres? 


4 During a certain period of the day, an average of four people enter a 
supermarket each minute. Calculate the probability that: 

(2) no-one enters during a particular minute; 

(1) three people enter during a particular minute; 

(111) five people enter during a particular minute; 

(tv) no-one enters during a two-minute period. 


4 Over a period of time, the number of serious road accidents in a city 
averages 1.8 per day. What is the probability of there being: 

(1) no accidents on a given day; 

(11) more than two accidents on a given day; 

(111) fewer than five accidents in a three-day period? 


5 The average proportion of bad eggs in an egg packing station is one in 
2 000. The eggs are packed in boxes containing six eggs each. 
(1) Evaluate to two significant figures the probability that a box contains 
one bad egg. 
(11) A housewife complains if she obtains two or more boxes, with one bad 
egg each, per 100 boxes. What is the probability that she complains? 
(M.E.I. ) 
6 A manufacturer of electrical components knows that he can expect 
0—2%% of his products to be faulty. What is the maximum number of com- 
ponents he can pack into a box, if he wishes to ensure that at least 90"% of 
the boxes contain no defectives? 


7 During the Second World War 535 flying bombs fell on South London. 
The distribution of the numbers of hits in 576 areas, each of 0.25 km? is 
given in the table. Show that the aim was effectively random within the 
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South London area by calculating the expected frequencies for a Poisson 
distribution with the same average number of hits. 


No. of hits 0 ] 2 3 4 5 6 or more 


Frequency | 229 ZiT 93 35 7 l 0 
(M.E.I.) 


8 The incidence of plumbing repairs in 78 council houses over a period 
of ten years is given in the table. Do the data support the view that there 
are good and bad tenants or the view that there are lucky and unlucky 
tenants? 


No. of repairs 


No. of houses $13. 16-16 10 9 ὁ 511 


(Use a Poisson probability model and compare theoretical and observed 
frequencies. ) (M.E.I.) 


9 At a stage in the mass production of lamp holders, random samples, 
each of 40 articles, are examined and the number of defective articles 
recorded. The numbers of defective articles in each of 200 samples are 
shown in the following frequency table: 


Defectives 0 ] 2 3 4 5 6 7 


No. of samples 29: “06: 42 42° 95... 7: ὦ: 1 


Find the mean number of defectives per sample and the variance. Give 
reasons for thinking that the distribution approximates to a Poisson dis- 
tribution. 

Show that on this assumption there is a probability of about 5% of a 
sample containing more than four defectives and a probability of less than 
1% of a sample containing more than six defectives. (ο.α.) 


10 Spot blemishes occur randomly along ἃ steel wire. The number 
counted in consecutive centimetre lengths had the following distribution: 


No. of spots 0 Ι 2 3 4 5 6 7 


Frequency 102. 150) 112 56 2] 6] 


If the count had been over consecutive 2 cm lengths, estimate the fre- 
quency of occurrence of 3 spots per 2 cm. (You may assume that the fre- 
quencies are close to a Poisson distribution. ) (S.M.P.) 


11 The following are the numbers of breakdowns of a machine in 40 
12-hour periods: 
0401100220110110132021221100200 
kOe 9.1 19 10 
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Calculate the mean number of breakdowns per 12-hour period and, using 
this value for in a Poisson probability model, calculate the probabilities 
of 0, 1, 2, 3, 4 breakdowns. Hence estimate the number of breakdown-free 
days in a year (taken to consist of 300 12-hour working days). Also estimate 
the number of days in the year when there will be more than 2 breakdowns. 


12 Using probability generators, show that if x, y have independent 
Poisson distributions with means p,, μν then x + y has a Poisson dis- 
tribution with mean μα + Hy. 


13 Instruments A and B are set to record radiation from separate and 
independent sources. The number of particles recorded by the instruments 
in one second have Poisson distributions with means 2 and 4 respectively. 
Calculate the probabilities that: 

(1) at least one particle is recorded on one of the two instruments in a 
second ; 

(11) a total of 4 or more particles is recorded on the two instruments in a 
second. (ο.α.) 


14 A factory produces high-quality china dishes. Flaws occur in the 
dishes themselves with a Poisson probability distribution, mean p, and 
flaws occur independently in the glazing, again with a Poisson probability 
distribution with mean μ. A dish is immediately rejected if it has a flaw of 
either kind. What is the probability that a dish is rejected? Hence find the 
probability that a rejected dish only had flaws in the glazing (expressing 
your answer in as simple a form as possible. ) 


15 Inasimplified probability model of the service in a barber’s shop it is 
supposed that all haircuts take exactly six minutes and that a fresh batch of 
customers arrives at six-minute intervals. The number of customers in a 
batch is described by a Poisson probability function, the mean number 
being 3. Any customer who cannot be served instantly goes away and has 
his hair cut elsewhere. The shop is open for 40 hours a week. Calculate the 
theoretical frequencies with which batches of 0, 1, 2, 3, 4, 5 and more than 
3 customers will arrive per week. 

The proprietor reckons that it costs him £25 a week to staff and maintain 
each chair in his shop, and he charges 25 p for each haircut. Calculate his 
expected ‘weekly profit if he has (1) 3, (11) 4, (111) 5 chairs. (S.M.P. ) 


16 Show that when an event occurs randomly in time (i.e., the number of 
events occurring in a fixed interval of time has a Poisson distribution — with 
mean 4) then the time interval between successive occurrences has the 
exponential distribution: 


p(x) =Ae™ (x > 0). 
The mean number of calls per minute made on a certain telephone 
exchange is two. Find: 


(1) the probability that in any given one minute interval not more than 
one call will be made; 
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(1) the probability that in any given half-minute interval exactly two 
calls will be made; 

(111) the probability that there will be no call in the half-minute immedi- 
ately following a call. (M.E.I.) 


*12.5 Samples 


In practical statistics we rarely consider individual readings or measure- 
ments in isolation ; the normal procedure is to use a sample, or set of readings, 
and from these readings to obtain a statistic, or representative value. The 
most common, and generally most useful, statistic is the mean. Intuitively 
we expect it to give us a good approximation to the expected value (or 
mean), μ, of the background population, and we also feel that somehow the 
mean is a more reliable estimate of u than is a single reading. We will now 
give some substance to these intuitive notions by extending our ideas of 
expectation to cover the expected value and variance of the mean of a 
sample. But before examining the mean itself, we must first establish some 
basic results in expectation algebra. 


Expectation algebra 

Suppose that x is a random variable,} and takes values 
Kies δα δ: νὰ 

with probabilities ~,, fo, β5:.--- 


We now (as in chapter 7) use the notation E[x] for the expected value, or 
expectation of x, so that 


E[x] = > xp; = μ. 
Furthermore, we denote the variance of χ by V[.x] so that 
V(x] = Σ C= L)*D; ra > xp; ra μ 
= El (x -- μ)" = El?) -- μα. 
Hence E(x] =u 
and V[Ex] = EL(x — μ)"} = EL] -- μ' 


We now let a be constant, so that ax takes values ax,,ax.,... with 
probabilities p,, po, . 


Hence E[ax] = Σ᾽ axip; 
=a) χερὶ 
= ak{[x]. 


+ Itcan be argued that what is generally referred to as a random variable is neither random nor 
variable. At this stage, however, it is undesirable to become too involved in such discussions, 
and since the term is in widespread use we shall continue to employ it. 
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Similarly V[ax] = δ᾽ (ax; — ap)*p,; 


= >» a” (x; — B)*p; 
=a’ Σ (x; — μ)", 
= αὐ ΚγΓχΊ. 


Summarising, 


Ε[αχ] = aE[x] 


V[ax] = a? V[x] 


The same results can be shown to hold for a continuous variable, using 
the probability density function, and integration instead of summation. 


Example 1 


If measurements are made in centimetres, the expected value and variance 
of length of leaf of a certain type of rhododendron are 5.5 cm and 2.4 cm? 
respectively. What will the expected value and variance be if measurements 
are made in millimetres? 


Let x denote the leaf length in centimetres, 
then El[x] = 5.5 and V{[x] = 2.4. 


The length in millimetres is given by 10x, so the expected length in milli- 
metres is given by E[10x] 


and E{10x] = 10E([x] 
=.5). 

Similarly V[10x] = 10°V[x] 
= 240. 


These results of course are hardly surprising, and simply fortify the 
above expectation results with common sense. 


Two random variables 


Let x, y be two random variables, each of them discrete. (An equivalent 
analysis is also possible for continuous variables, but it involves the use of 
more advanced mathematics, with which the reader is probably not yet 
acquainted ; it can be found in most specialised probability texts.) 


Let «x take possible values x,, %),..., %j,. 
and_ y take possible values y,, ¥2,..-, Yj, - 


(a finite or infinite number in each case.) 


We define the joint probability distribution of x andy by p(x = x;, ¥ = ψ}); 
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which can be written p(x;, yj) or more simply 9;;. 


Then (x;) = Σ᾿ p,;, summing over all possible values of 1, 
j 
and = p(y;) = δ fiz, Summing over all possible values of 7. 


This situation can now be summarised in the table: 


Using this notation, let us now investigate the expected value and variance 
of x + y, the sum of the two random variables. 

Considering the values of x + y which arise from all the combinations of 
values of x and y, 


Elx + y¥] = DD (xi + ψῆρῃ 

= Σ » Xipiy + Σ DY jPij 

= » (x; Σ pij)) + 2 (y; 2 bij) 

= Σ χρὴ) + Σ yjP(y,) 

= Elx] + Ely]. 
Also Ef[xy] = > > Dis *iY;- 

ij 
But x, y are independent > 20) = p(x;) p(y;) 
=> E[xy] = Σ 2 P(*:) PYs) xy; 

2 b(%) Κι 2 P(Ys) yj 
£ [x] ΕἸ]. 
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E[(x + y)?] — Εἰς + y])? 
E[x? + 2xy + y?] — ΕΔ] + Ely])? 


= {E[x?] — (E[x])?} + {E[y*] — (E[y])?} 
+ 2{E[xy] — E[x] Ely} 


= V [x] 2 Vy] Ἔ 2Cov[x, y], 


Moreover, V[x + y] 


where Cov[x,y] = E[xy] — E[x]E[y] and is known as the covariance of 
x and y, which is a measure of their interdependence. 
In particular, we have seen that x, y are independent 


> Ely] = Elx]Ely). 
=> Cov[x,y] = 0 
=> V[x + y] = V[x] + Vy]. 


Summarising: in general Ε[χ + y] = Ε[χ] + ELy] 
Vox + y) = ΥΧ] + ΚΓΨ] + 2 Cov Lx, y] 


but if x, y are independent Vix + ψ] = Vix] + Vy] 


Similarly, Ἔχ — y] = E[x] -- Εἰ] 
Vix -- ψ] = Vox) + ΥΚ[χ] — 2 Cov [x, 9]; 


and note that for independent x, y, 


Vix — yl = Vie) + VEy); 


so that the difference of two independent variables has exactly the same 
variance as their sum. 


Example 2 


A normal ‘fair’ 6-faced die is thrown together with a ‘fair’ tetrahedral 
(4-faced) die. What is the expected value and variance of the total score 
registered on the two dice? (For the tetrahedral die, the score registered is 
that of the face on which it lands.) 


Let the score on the 6-faced die be denoted by x; 


then E[x] =2(11+24+34+4+4546) 


1 
6 
= 3.5. 
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and =V[x] = @(1* + 27 + 35 + 47 + 5? + 6?) — 3.53 


= 91 — 3.52 
= 15.17 — 12.25 
= 2.99. 


Now let the score on the 4-faced die be denoted by y 


then Ely] = 4(1+2+3+4 4) 
= 2.5, 
and = V[y] = 4(17 + 25 + 37 + 47) — 2.55 
= 32 — 2.5? 
= 7.5 — 6.25 
= 1.25. 


Now the total score is given by « + y 


and E{[x+y] = Elx] + £ly] 
= 3.5 2.8 
= 6 


and if we assume that the two scores are independent, then 


Vix + yj = Vix] + VEy] 
= 2.92 + 1.25 
= 4.17. 


So the expected value and variance of the total score are respectively 


6 and 4.17. 


Example 3 


A firm which produces garden tools manufactures the blades and shafts 
(including handle) of its spades separately. The blade lengths (up to the 
point of attachment to the shaft) are approximately Normal, with mean 
30 cm and standard deviation 1 cm, and the lengths of the shafts are also 
Normally distributed with mean 78cm and standard deviation 5 em. 
What percentage of spades, when assembled, have lengths exceeding 
110 cm? 

We will assume (without proof) that the sum of two Normally dis- 
tributed variables is itself Normal. We will denote the length of the blade 
by x, and the length of the shaft by y. Therefore x + y denotes the total 
length of the spade. 


Hence E[x + y] = E[x] + Ely] = 30 + 78 = 108 
and V[Ex + y] = Vx] + Vy] = 5? + 1? = 26. 


So the mean total length = 108, and its standard deviation = ./26 = 5.1. 
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Now since we have assumed that x + y is Normally distributed, 


p(x + y > 110) 


110 — 
4 -φ( 0 “ἢ 
5.1 


= 1 — (0.392) 
= ] -- 0.65 (to2d.p.) 
= 0.35. 


Therefore 35% of the spades are longer than 110 cm. 


Exercise 12.5a 


1 Prove that, if ais a constant, 
(1) Elx + a] = Elx] +a; 
(1) Κὺχ + a] = V{[x]. 


2 Ifx and y are random variables, and a, ὦ are constants, obtain expres- 
sions, in terms of E[ x], E[y], V[x] and V[y] for E[ax + by]and V[ax + by]: 
(1) when x, y are independent; 

(11) when x, y are not independent. 


4 Extend no. 2 to three mutually independent random variables x, y, z 
and constants a, 6, c to obtain expressions for: 

(1) Elax + by + cz]; 

(1) Vax + by + ez). 


4 Find the expected total score obtained when three standard, ‘fair’ dice 
are thrown together. What is the variance of this score? 


5 Twostandard, ‘fair’ dice, one blue, the other red, are thrown together, 
and a total score is obtained by doubling the number shown on the blue 
die and adding to it the number on the red die. What are the expected 
value and variance of this score? Compare your results with those obtained 
in question 4. 


6 ‘Two standard ‘fair’ dice are thrown, and their scores recorded. What 
are the expected value, and standard deviation of the difference between 
the two scores? 


7 Alan and Bob play a game in which Alan uses a 6-faced die and Bob a 
4-faced die (as described in the text in example 2). To make the game 
‘fair’, they decide to double all Alan’s scores and treble Bob’s. Would you 
expect this to lead to a fair game? If not, who has the advantage? Evaluate 
the variance of each (increased) score on the single throw of a die. 


8 Prove that if x, y are independent, E[xy] = E[x] x Ey]. Hence cal- 
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culate the expected product of the scores obtained on two standard un- 
biased dice. 


9 Ifx andy are independent random single digit numbers other than zero, 
calculate the variance of 

(1) x; (1) x+y; (a) 2χ; (w) 2χ -- γ. (5.Μ.Ρ.) 
10 =A man travels from his London office to his home by a tube journey 
from station A to station B. His walking times to A and from B add up to 


9 minutes with negligible variation, the variable factors in the journey 
being as follows, measured in minutes: 


Standard 
deviation 


(2) waiting for a train 
(11) train journey 


Assuming that these two factors are independent and normally distributed, 
find the mean time and standard deviation of his whole journey. 
Estimate the probability of the whole journey taking 
(1) less than 52 minutes; 
(7) more than 65 minutes; 
(111) between 57 and 62 minutes. (M.E.1.) 


11 ‘Three independent components are placed in an electrical circuit in 
such a way that, as soon as the first fails, the second (reserve) component is 
automatically switched in, and then when the second component itself 
fails, the third is automatically switched in. The circuit functions as long 
as one of the components works. If the lifetimes of the components are 
Normally distributed with mean 2 000 hours and standard deviation 300 
hours, what is the probability that: 

(2) the circuit will function for more than 6 500 hours; 

(11) the circuit will break down within 5 000 hours? 


12 Ifx,,x),..., x, are independently distributed variables with means 
Hy, Hg,..-, UM, and standard deviations 0,,0,,..., 0, respectively, write 
down the mean and standard deviation of y = A,x, + λοχο +--+ ++ AgX, 
where 4,,4,,..., 4, are constants. 

In the manufacture of certain metal tubes there are three distinct pro-. 
cesses. ‘Ihe mean times for each process and the standard deviations of 
these times are given in the following table: 


Standard 

deviation/s 
Process ] 6 
Process 2 i 
Process 3 Ν᾽ 
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During the course of its manufacture each tube is subjected to processes 
1 and 3 once each, and to process 2 five times. Assuming that the times 
spent at each stage are distributed independently, calculate the mean and 
standard deviation of the total time taken in the manufacture of a tube. 
Assuming Normal variation, estimate: 

(2) the percentage of tubes which would take less than 130 seconds to 
manufacture; 

(1) the percentage of tubes which would take more than 160 seconds to 
manufacture. (M.E.I.) 


12 Ina factory, cylindrical pins are made to fit into circular holes in blocks 
of metal. The diameter of a pin is a Normal variable with mean 9.80 mm 
and standard deviation 0.10 mm. The diameter of a hole is a Normal 
variable with mean 10.00 mm and standard deviation 0.12 mm. A pin 
and hole ‘fit’ if the diameter of the hole exceeds that of the pin by not more 
than 0.30 mm. Find the probability of getting a ‘fit’. (Assume difference in 
diameters to be Normally distributed.) (c.) 


14 Two independent random variables x, and x, have zero mean and 
the same variance, σ΄. Given thaty, = a,x, + dox. andy, = b,x, + box, 
where @,, a), 6,, b, are constants, show: 

(1) that V[y,] = (a; + a3)o°; 

(11) that the condition for Cov [y,, yo] = 0 is a,b, + ab, = 0. 

Obtain an expression for V[z] where z = A,y, + Ayo and A,, A, are 
constants. 


Mean of n independent variables 


We will consider a sample of n random variables x,, x.,..., χη» Chosen 
independently from some background population. (Note that we are 
using the ‘names’ x,, %2,..., Χῃ in a somewhat different sense from that 
used previously, when they stood for the different values a single variable 
could take; convenience outweighs any possible ambiguity). Then each 
variable individually will have the same expected value μ and variance co? 
as the background population. Let m be the mean of the sample. 


Then Elm] = E : Σ 
n 


Ι 

| 
Gy 
— 
M 

= 
Lad 


Ι 
| 
M 
by 
π᾿ 
εἶ 
co 
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and V[m] = ), r*| 


| 
N 
~Ms 
= 
ΓΓῚ 
δὲ 
εἰ 


2 


So: || Ein) = and: Vint = — 
n 


and the standard deviation of the mean (also known as the standard error of 
the mean) is given by o/,/n. 

Hence we are absolutely justified in using m as an estimate of μ; and 
since the variance of the mean is less than that of individual readings, it 
follows, as expected, that there is less variation in the values of m than in the 
individual values of x;. 


Small and large samples 


Although we have been able to specify how the expected value and variance 
of any sample mean may be obtained, for a small sample there is no 
immediate way of stating its probability distribution; each must be 
analysed individually. 

For large samples however, we can use the central limit theorem, which 
tells us that any mean of a large sample has an approximate Normal dis- 
tribution, irrespective of the population from which it is drawn. 


Samples from a Normal distribution 


Since the central limit theorem tells us that the mean of a large sample is 
Normally distributed, it seems reasonable to assume (as can be proved) 
that the mean of any sample (large or small) drawn from a population which 
is itself Normally distributed, will be exactly Normal with expected value μ 
and standard deviation σι Nn. 


Example 4 


A firm produces jars of jam in such a way that the net mass of jam per jar is 
known to be Normally distributed with mean 340 g and standard deviation 
12 g. The jars are packed in cartons of 16. In what percentage of the car- 
tons will the average mass of jam per jar be less than 336 g? 

For a sample of 16 jars, the mean mass m will be Normally distributed 
with μ = 340 ando = 12/,/16 = 3. 
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336 — 340 
Therefore p(m < 336) = of) 
= O(— 1.33) 
= 1 — (1.33) 
= 1-091 (to2d.p.) 


= 0.09. 


So the average mass of jam per jar will be less than 336 g in 9% of cartons. 


Example 5 


A laboratory obtains a certain chemical solution from the manufacturers in 
100-cm? containers. The manufacturers state that on average the mass of 
impurity in each container is 3.8 mg, and its standard deviation is 0.9 mg. 
For a certain experiment the laboratory wants to ensure that there is at 
most a 5% chance that the mass of impurity in each 100 cm? of solution 
it uses exceeds 4.0 mg. How many containers of solution need to be mixed 
together to ensure this? 

Suppose that we mix n containers of chemical. We will assume that n is 
large, so that the mean mass of impurity in the sample so obtained is 
approximately Normally distributed, with 


w= 38mg and o = 0.9/,/nmg. 


We require p(m > 4.0) < 0.05 
=> pim < 4.0) > 0.95 


4.0 — 3.8 
—_—_— ® (1.67 
om [ 0.9/,/n ep oe 
0.2,/n 
> 1.67 
a 0.9 
- Jn > 167 x 4.5 = 7.52 
=> n > 56.7. 


So at least 57 containers of the solution need to be mixed together. 


Exercise 12.5b 


x Assuming that the heights of male college students are Normally dis- 
tributed with mean 173 cm and standard deviation 8 cm, what is the 
probability that the mean height of a sample of 40 students will 

(1) exceed 175 cm; 

(1) lie between 172 and 174 cm? 


2 The marks obtained in a mathematics examination are approximately 
Normally distributed with mean 54 and standard deviation 13. What is 
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the probability that the average mark for a group of 10 students 
(1) exceeds 60; 
(11) is less than 50? 


3 Acertain industrial process takes a worker an average of 24 minutes to 
complete, and the standard deviation of the completion time is 6 minutes. 
If the process is repeated 100 times, use the central limit theorem to find 
approximately the probability that the mean process time exceeds 25 
minutes. 


4 Each schoolboy in a class of 36 spins an unbiased coin 100 times. What 
is the probability that the average number of heads obtained per boy 

(1) is less than 49; 

(11) is greater than 52? 

(A continuity correction is not necessary in this case.) 

5 Capsules of a certain drug have a nominal net mass of 10 g. They are 
packed in batches of 100 capsules. A machine fills the capsules with the 
drug in such a way that the masses are normally distributed with mean net 
mass 10.06 g and standard deviation 0.2 g. Find the probability that: 

(1) a batch does not contain any ‘underweight’ capsules; 

(22) the mean net mass of the capsules in a batch does not fall below the 
nominal net mass. 


6 Acoal merchant sells his coal in bags marked ‘50 kg’. In fact he claims 
that the average mass is 50 kg, with a standard deviation of ] kg. A suspici- 
ous trade inspector has 60 of the bags weighed, and finds that their mean 
mass is 49.6 kg. What is the probability that such a result, or a more extreme 
one, could have arisen if the coal merchant’s claim is true? Do you think 
that the inspector’s suspicions were justified ? 


7 Electric light bulbs are produced with a mean life-time of 1 600 hours 
and a standard deviation of 250 hours. Calculate the minimum sample 
size necessary to ensure that there is at most a 1% chance of the average 
life-time of such a sample being less than 1 550 hours. 

(Assume that the sample will be large enough for the mean to have an 
approximately Normal distribution.) 


8 How large a sample would you take in order to estimate the mean of a 
population, so that the probability is 0.95 that the sample mean will not 
differ from the true mean of the population by more than 0.2 of the standard 
deviation of the population? 


Unbiased estimates 


If T is some statistical estimate calculated from data, and θ a probability 
parameter, then if E[T] = 6, T is said to be an unbiased estimate of 0. 

It should be noted, however, that an unbiased estimate is not necessarily 
a particularly good estimate — its variance, for instance, may be quite 
large. It is simply the best we have. 
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It therefore follows, since E[m] = μ, that the sample mean m is an 
unbiased estimate of μ, the mean, or expected value, of the parent popula- 
tion. 

Since m is an unbiased estimate of μ, it would seem to follow that s? 
should be an unbiased estimate of σ΄; after all, both are referred to as 
variances, and as we saw in 12.1, their forms are very similar. To pursue 
this further, we note that s? is itself a random variable, depending only on 
the sample from which it is calculated, and we can therefore consider its 
expected value. 


] 
Now s* = =o) (x; — m)?, 


and in this form it is a little difficult to manage, since we cannot say any- 
thing directly about E[(x; — m)*]. However, we do know that 


Also, recalling that x, has mean m and variance s2, it follows that xi — U 
has mean m — yp and variance s°. 


l 
Sos? =—) (x — w)? — (m— p)* 


Ι 
= ἘΞ ΕἸΣ (% Ξ 4 | — E[(m -- μ)}} 


= σ᾽ — V[m] 
2 
σ 
= og" —-—<—<$<—— 
n 
n— | 
= o?. 
n 


So s* is not an unbiased estimate of o?, and in fact s? will tend to give a 
low value for σ΄" (as can readily be seen by considering the zero value of s? 
that would arise if the sample had only one member). 


n— Ἰ 


Therefore al ᾿ | ἘΨΨ ΒΟΥ aes : σ΄ = σ". 


so that an ‘unbiased’ estimate of σ is 
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Exercise 12.5¢ 


1 The lengths, in millimetres, of eight full-grown fish of a certain species 
were found to be : 56, 49, 68, 58, 63, 60, 55, 59. Obtain unbiased estimates for 
the mean and variance of the length of the species. 


2 Asample of 15 leaves was obtained from a certain plant, and the areas 
of these leaves, in square centimetres, are given below: 


4.60, 5.85, 5.13, 3.94, 5.61, 4.08, 4.71, 4.29, 6.01, 5.46, 5.80, 6.33, 4.43, 4.90, 
5.09. 


Obtain unbiased estimates of the mean and variance of leaf area (correct 
to 2 decimal places). 


3 Ina sample of bars of chocolate produced in a particular factory, the 
mean mass was found to be 101.3 g and the standard deviation 1.6 g. 
Obtain an unbiased estimate (correct to 2 decimal places) of the standard 
deviation of all bars of chocolate produced in the factory if the sample size 


(i) 5; () 20; (ἰϊ) 100. 


4 ἰ isan unbiased estimator for a parameter 0, and ἐ;,1 = 1, 2,..., are 
values of ἐ obtained from n experiments. 

(2) Prove that 

n 
δ᾽ ἢ is an unbiased estimate οὔθ. 


] 
NM j=] 


! 


(1) Find the relationship between the numbers 1λ;, (¢ = 1,...n) if 2 A;é; 
is an unbiased estimate of @. 


5 A random variable x has expectation μ and variance σ΄. Find the 
variance of 


c= 


Ι 
and the expected value of s* = ay Cee 
nN — 


where x,, X9,..-, X, are ἡ independent observations from the distribution 
of x. 
A further set of m independent observations ¥;, ¥2,.--; Ym 18. taken 


from the same distribution and 


= 


] m 
Σ (yj = γ)"“. 
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If wu? = As? + (1 — A), (0 < A < 1), show that s?, ¢? and μ΄ have the 
same expectation and find A such that V[u?] is a minimum. 


(Take Tee (M.E.I.) 


ey 


*12.6 Significance and confidence 


In a television advertisement. out of a sample of eight housewives inter- 
viewed leaving a supermarket, six were found to prefer the new washing 
powder, ‘Squelch’. Is this convincing evidence that more housewives use 
Squelch than all other powders? 

This question involves firstly practical and experimental considerations, 
and secondly mathematical considerations. The first thing we must know 
is how the eight housewives were chosen. Were they chosen randomly? 
(For the present we must be content with an intuitive notion of random- 
ness.) Or did the interviewer particularly look out for shopping bags con- 
taining packets of Squelch? Such questions must be asked before any 
meaningful mathematical deductions can be made. However since we are 
primarily concerned with the mathematics of such situations, we will 
assume that these practical questions have been settled, and for the pur- 
poses of this example we will assume that the eight housewives were chosen 
randomly. 

We shall not (and indeed we cannot) answer the question about prefer- 
ence for Squelch directly. Instead we invert the question and ask: 

‘Assuming that exactly 50% of housewives prefer Squelch to other 
powders, what is the probability that this result (six out of eight choices 
for Squelch), or a more extreme one, will occur by chance?’ 

(The phrase ‘ this result, or a more extreme one’ though it seems to complicate 
the problem, does in fact provide us with a standard method for dealing 
with such problems and the necessity of its inclusion will become in- 
creasingly obvious. ) 

If this probability is large, then there is no reason to doubt the assumption, 
but if the probability is small then there is considerable evidence against 
our assumption; we become sceptical about the assumption that house- 
wives are indifferent, and increasingly certain that they do indeed prefer 
Squelch to other powders. 

The phrase ‘it is not true that more housewives prefer Squelch to any 
other powder’ could be interpreted mathematically in a number of ways. 
We will consider the most extreme case for which the statement holds, when 
exactly half prefer Squelch, so that the statement 15 only just true. (Apart 
from any other considerations of course, this will simplify the mathematics.) 
We therefore have a binomial situation, with ‘success’ being regarded as 
finding a housewife who prefers Squelch. 


So n=8, p=%, and the probability distribution is: 
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| 
| 
1 
| 
| 
| 
| 
| 
| 
| 
| 


8 1\° 8 1. 8 1\° 
p( =} 6 ‘successes’) = 5 x (5) + (5) x {} + ( Χ Β 


] ] ] 
= 28 x—+8x754¢1x=- 


98 98 9& 
_ 37 
~ 256 
= 0.145 ~ 4. 


Therefore there is a probability of 5 that six out of a sample of eight house- 
wives would choose Squelch even when there is no overall preference for 
the powder. So we should regard the evidence of the advertisement as not 
particularly convincing. 

This analysis is one example of a very simple significance test. The proba- 
bility 7 is a measure of the significance of the experimental result on the 
assumption of no overall preference. More generally, significance testing 
requires the use of two basic concepts: null hypothesis and level of significance. 


Null hypothesis 


This is the basic hypothesis upon which we build a probability model. It 
should be a precise statement, which lends itself to mathematical formula- 
tion. In our example the null hypothesis was that exactly half of housewives 
prefer Squelch. The term ‘null’ arises primarily from contexts such as 
medical research, where new treatments or drugs are tested for effective- 
ness, the null hypothesis being that such a treatment has no effect; and 
generally speaking a null hypothesis takes the form of just such a negative 
statement. 


Levels of significance 


Using the null hypothesis, we evaluate the probability of obtaining such an 
extreme value as our sample statistic: the smaller this probability, the 
greater is the evidence against the null hypothesis. We normally use three 
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standard levels of significance: 5%, 1%, and 0.1%, corresponding to 
probabilities of 0.05, 0.01, and 0.001 respectively. If, for instance, we 
obtain a probability of 0.03, we say that this result is significant at the 5% 
level (but not at the 1% level), and we can therefore reject the null hypo- 
thesis at the 5% level. Which significance level is used generally depends 
on the nature of the experiment or research. 


Errors 


There are two distinct types of error which may arise from significance 
testing: 
1 A sample may not provide significant evidence against the null hypo- 
thesis, and so the hypothesis may be accepted when in fact it is incorrect. 
2 Ifwe reject the null hypothesis at, say, the 5% level, then in 5 out of 100 
trials the hypothesis could be correct, and the extreme probability will have 
occurred purely by chance. The incidence of this type of error can of course 
be reduced by using a higher level of significance, for example the 1% 
level. 

But further detailed analysis of such errors is beyond the scope of this 
text, and for the present we must just accept that such unavoidable errors 
may arise. 


The following two examples are typical significance tests. 


Example 1 


Before the introduction of a new drug, the success rate in treatment of 
patients for a certain disease was 43%. However, of 88 patients treated with 
the drug, 51 recovered. On the basis of this evidence, is the drug signifi- 
cantly effective in treating the disease? 


Null hypothesis: That the drug has no effect, i.e., the probability of a patient 
recovering is still 0.43. 


We therefore have a binomial situation, with n = 88, p = 0.43. 

Since n is large we can approximate by a Normal distribution with 
μ- 88 x 0.43 = 37.8 

and o = ,/(88 x 0.43 x 0.57) = 4.64. 


Therefore (51 or more recoveries) = 1 — p(< 50 recoveries) 


ices 90.5 — 37.8 
4.64 


(using a continuity correction) 
1 — @(2.74) 

1 — 0.997 

0.003. 
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As 0.003 < 0.01, this result is significant at the 1% level. So the proba- 
bility, on the basis of the null hypothesis, of obtaining the observed result 
is very remote, and we can say with considerable confidence that the drug 
is effective. 


Example 2 


The machine described in example 1 of section 12.5, filling jars with jam 
such that the mass of jam is Normally distributed with up = 340g and 
o = 12g, breaks down. After it is repaired, it is found that the average 
mass of jam in the first 100 jars it fills is 342 g. Has the performance of the 
machine significantly altered? 


Null hypothesis: The machine is unchanged, i.e., the means of samples, 
size 100, are Normally distributed, with mean 340 and standard deviation 
12/,/100 = 1.2. 

We need to find the probability of a result of 342, or one more extreme, 
occurring by chance. In this case, however, a more extreme result could 
reasonably be one less than 338, since we are not really concerned whether 
the mass of jam is increased or decreased, but simply whether it has changed, 
and the relevant probability is indicated by the shaded areas in the diagram. 


For fairly obvious reasons, therefore, this is referred to as a two-tailed (or 
two-sided) test, as opposed to the previous example which was one-tailed. 


42 — 340 
Now (338 < mass < 342) = 2] — oe) 


1.2 
= 9[1 — ®(1.67)] 
= 2[1 — 0.952] 

= 0.096. 


This is not significant at the 5% level, and so we can say that this evidence 
does not indicate any significant alteration in the machine’s performance. 
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Exercise 12.6a 


1 A coin is tossed eight times, and a head is obtained each time. Test, at 
the 1% level, the hypothesis that the coin is unbiased. (Use a two-tailed 
test.) 


2 Anamateur magician claims that he can read people’s minds. To prove 
this he tells a person to think of one of the colours blue, yellow or red, and 
he then predicts what that colour is. In ten such mind-reading sessions he is 
correct seven times. Is this sufficient evidence to accept his claim? 


3 Test, at the 5% level, the hypothesis that the variable χ is Normally 
distributed with mean 80 and standard deviation 10, if we obtain: 

(1) avalue of x equal to 100; 

(1) a value of x equal to 65; 

(11) three values of x, all less than 70; 

(10) three values of x, whose average is equal to 70. 


4 Over anumber of years an average of 40 out of 100 patients who under- 
went a difficult operation survived. Last year new medical techniques were 
introduced and 73 out of 150 patients survived the operation. Explain 
whether or not the maintaining of these techniques is statistically justified. 

(S.M.P. ) 


5 In acertain constituency, in the last election, ἃ of the electorate voted 
for the Nationalist Party. In a sample of 200 voters it is now found that 107 
support the Nationalists. Test the hypothesis that the support for the 
Nationalist Party has not decreased since the last election. 


6 A tyre company claims that the lifetimes of its tyres are Normally dis- 
tributed with mean 34 000 km and standard deviation 6 000 km: 

(1) A motorist finds that one of their tyres lasts only 26 000 km. Does this 
lead you to doubt the manufacturers’ claims? | 

(11) A car-hire firm using the tyres finds that the average life-time of 100 
such tyres is 32 200 km. Does this evidence lead you to doubt the manu- 
facturers’ claims? 


7 ‘The masses of a certain species of rabbit are known to be Normally dis- 
tributed with mean 1.68 kg and standard deviation 0.24 kg. Nine rabbits 
are fed on specially enriched foodstuffs, and their average mass after two 
months is found to be 1.85 kg. Test at 

(1) the 5% level, 

(11) the 1% level, the hypothesis that the foodstuff does not increase the 
rabbits’ masses. 


8 The following experiment was carried out 100 times by each of 16 pairs 
of children in a class. ‘Each pair has an ordinary pack of cards from which 
one child selects a card and replaces it; the other child does likewise, and 
they score a point if the suit is the same for both’. The average number of 
points scored per pair was 27.2. Test at the 5% significance level, the 
hypothesis that all the cards were drawn at random. (S.M.P.) 
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g Arandom sample of 200 persons from a large population is examined 
for eye colour. In this sample 50 persons are found to have blue eyes. Is this 
result consistent with the hypothesis that the proportion of blue-eyed per- 
sons in this population is 0.2? What would have been your conclusion if 
there had been 200 blue-eyed persons in a random sample of 800 persons? 

(M.E.I. ) 


10 A grower sows 200 Carlton lettuce seeds under conditions such that 
the average germination rate is 7570. By using a suitable approximation, 
find 

(1) the probability that more than 170 seeds germinate, 

(11) the probability that less than 140 seeds germinate. 

The grower also sows 120 Alberni lettuce seeds under the same conditions 
as the Carlton seeds, and finds that 82 seeds germinate. Test whether the 
Alberni seeds have a germination rate less than 75%. (α.) 


11 The protein intakes for a sample of six unemployed men was observed 
to be 86, 82, 66, 66, 94, 104 g day *. A large number of employed men 
had an average intake of 97.6 g day * and the variance of their intake 


was 225. 
Do these data support the view that at the time of the study unemployed 
men were underfed by comparison with men in employment. (M.E.I.) 


12 You are engaged as an expert witness for the prosecution in a court 
case in which a gaming club is accused of running an unfair roulette wheel. 
The evidence is that out of 3 700 trial spins, zero (on which the club wins) 
turned up 140 times. There are 37 possible scores on a trial spin, labelled 
0 to 36, and these should have equal probability. Test whether there 15 
evidence that the wheel is biased. (ο.} 


13 According to a certain hypothesis the variable x 15 uniformly dis- 
tributed in (0, 1). Determine the probability that the smallest member of a 
sample of n observations of x has value not less than X where 0 < X < 1. 
Ten observations were taken of x; the values of the smallest and largest 
-were 0.2 and 0.7. Calculate the probability that ten observations: | 
(2) are each not less than 0.2; 
(11) are each not greater than 0.7; 
(111) lie within an interval of length 0.5. 

State whether your results would lead you to reject the hypothesis. (α) 


14 Inasample of 1 000 voters, an opinion pollster finds that 520 intend 
to vote for Jones and 480 for Smith in the forthcoming election. What is the 
probability of a result such as this, or one even more favourable for Jones, if 
in fact Jones has only the support of 50% of the electorate? How many 
people in the sample would need to support Jones to make it significant 
(1) at the 5% level, 

(11) at the 1% level; 

that he has more support than Smith. 
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15 Achildren’s test which had been standardised some years ago to have 
a mean of 100 and a standard deviation of 15 was applied recently to a 
random group of children, and their mean score was recorded as 101.6, but 
no record was made of the number of children tested. What is the least 
number of children tested if the result shows that the population mean is 
unlikely still to be 100? (You should assume that the mean value will cer- 
tainly not have decreased. Work at the 1% significance level.) (S.M.P.) 


Confidence intervals 


The links in lengths of chain produced in a factory are known to have 
breaking tensions which are Normally distributed with standard deviation 
0.24 kN. Eight links are tested to destruction, and are found to have break- 
ing tensions of 3.61, 3.58, 3.66, 3.65, 3.89, 3.48, 3.50, 3.59 ΚΝ. What can 
we say, if anything, about the average breaking tension of all links pro- 
duced in the factory? 

Let us denote the average breaking tension (of the background popula- 
tion) by uw. 

Now the sample mean (which for this sample is 3.62) is distributed 
Normally, with the same p as the background population, and σ = 0.24/,/8 
= 0.085. 

We know that (m — y)/o has a standard Normal distribution, and from 
the properties of the standard Normal curve, it is 95% certain that 
(m — n)/o lies between — 1.96 and 1.96. 


—1-96 


So p( -1.96 << 1.96) = 0.93 
=> p(—1.960 « κι — μ < 1.960) = 0.95. 
Rewriting the orderings, 


p(m — 1.966 < p< m+ 1.960) = 0.95. 


Therefore for the chain links, it is 95% certain that 


3.62 — 1.96 x 0.085 < w < 3.62 + 1.96 x 0.085 
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= 3.62 — 0.17 <p < 3.62 + 0.17 
=> 3.45 <p < 3.79. 


The interval 3.45 to 3.79 is referred to as the 95% confidence interval for ἡ, 
and 3.45 and 3.79 are the 95% confidence limits. 
Similarly the 99% confidence limits for μι are 


3.62 + 2.58 x 0.085 
= 3.62 + 0.22 
(1.e., 3.40 and 3.84). 


The 95% and 99% confidence intervals are the standard intervals used, 
but there is no reason why others should not be employed. Quite naturally, 
the more certain we need to be of the interval within which μ lies, the 
larger that interval becomes. So for any experimental results, we have to 
balance the usefulness of the result with the confidence we can put in it, in 
choosing a suitable interval. Also, of course, the larger the sample used, 
the narrower will be the confidence interval so obtained. 

The general procedure for establishing confidence limits for p (assuming 
a Normal distribution, with known standard deviation) is as follows: 


(1) Obtain a representative value — this may be a single reading or 
(better) the mean of a sample — and determine its standard deviation. 
(1) Decide on a confidence coefficient: 0.95 is for example the confidence 
coefficient. corresponding to a 95% confidence interval. Denote this co- 
efficient by 1 — 2α (so for a 0.95 coefficient, a = 0.025). 

(111) Define c, by 


®(—-—c,) = «4 
=> WO(c,)= 1-4, 


therefore the unshaded area is 1 — 2a, and the shaded areas are each «. 


(wv) Then, in the case of the sample mean. 


ΩΝ ΕΣ 2 b= 2a 
m—-—¢ m+—¢})=1- 
p Jn a μ Jn a 
and the confidence limits are m + oc,/./n. 


Similarly, for a single observation x, the confidence limits for μι are x + o¢,. 
However the confidence intervals obtained from single observations are 
usually so wide as to be of little real use. 

This technique of obtaining confidence intervals is very useful. But it 
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does have limitations, the most obvious of which is that the standard 
deviation of the population has to be known; in practice, however, @ 18 
often not known, and has to be estimated from the sample. A more com- 
plicated technique has then to be employed (using what is known as the *?’ 
distribution), and the effect of using an estimated value of σ᾽ is to widen 
the equivalent confidence intervals for a known value of σ. However, for 
large samples the process described above can given good approximations 
to the confidence limits even when the standard deviation has to be esti- 
mated. (See exercise 12.6b, no. 7.) 


Exercise 12.6b 


1 It is known that the variable x is Normally distributed with standard 
deviation 8. The average size of a sample of 10 values of x is found to be 63. 
Find (1) 90%, (11) 95%, and (a) 99% confidence limits for the mean value 
of x. 


2 The melting points, in °C, of 10 samples of a certain metal were found 
to be 1154, 1151, 1154, 1150, 1 148, 1152, 1155, 1153, 1 149, 1 154. 
Past experience indicates that these observations will be Normally dis- 
tributed, with standard deviation equal to 3 °C. Find (1) 95%, and (11) 99% 


confidence limits for the mean melting point of the metal. 


3 It is known that the heights of 14-year old schoolboys are Normally 
distributed, with a standard deviation of 9.2 cm. The average height of 
16 such schoolboys, chosen at random in a certain school, is found to be 
159.1 cm. Find (1) 95%, and (7) 99% confidence limits for the average 
height of 14-year old schoolboys. 


4 The average length of 200 nails produced by a certain machine was 
found to be 7.42 cm. If the standard deviation of nail lengths is known to 
be 0.53 cm, find (1) 95% and (11) 99% confidence limits for the mean length 
of all nails produced by the machine. 


5 Itis known that an examination paper is marked in such a way that the 
standard deviation of the marks is 15.1. In a certain school, 80 candidates 
take the examination, and they have an average mark of 57.4. Find 
(1) 95% and (22) 99% confidence limits for the mean mark in the examina- 
tion. 


6 A factory manufacturing ammeters tests them for zero errors in their 
calibration. From past routine tests, it is known that the standard devia- 
tion of these errors 15 0.3. 

A batch of 9 ammeters, taken from one worker’s production has zero 
errors of 1.0, —0.1, —0.3, 1.6, 0.5, 0.4, 0.5, 0.2, —0.2. Test whether there 
is evidence of bias in the ammeters produced by this worker, and establish a 
95% confidence interval for the mean zero error of his ammeters. (ο.) 
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7 A random sample of 100 capacitors, each of nominal capacitance 
2 pF was taken from a very large batch. The capacitance of each capacitor 
in the sample was measured. The results of these measurements are sum- 
marised in the following table: 


Capacitance/HF 
(mid-interval value) | 1.85 1.90 1.95 2.00 2.05 2.10 2.15 


No. of capacitors 2 12 23 31 20 10 2 


Calculate the mean and the standard deviation of these capacitances. 
Calculate also the standard error of the mean, and use it to determine 95% 
confidence limits for the mean capacitance of capacitors in the batch. (Use 
a Normal approximation. ) (M.E.1.) 


8 Ina random sample of 1 000 housewives from a large population 300 
stated that they used a certain detergent. Show that 95% confidence 
limits for the proportion of the population using this detergent are (approxi- 
mately) 0.272 to 0.328. (Use a Normal approximation to the binomial 
distribution, and an estimated value of a.) 

Following an advertising campaign a second sample of 800 housewives 
was taken, and of these 260 stated that they used the detergent. Is this 
evidence of the success of the campaign? 

Subsequently it was decided to give a free gift with each packet of the 
detergent, and later in a random sample of 600 housewives 216 stated that 
they used the detergent. Does this indicate the success of the free gift 
scheme? (M.E.I. ) 


g The probability density function of a variable x takes the form shown 
in the diagram, the constant a being unknown. A single observation x, is 
made of x. Give an unbiased estimate of a, and calculate 95% confidence 
limits for a. (Use first principles. ) 


d(x) 


After a total of 24 observations have been made, the mean x is calculated. 
Use this mean to obtain an unbiased estimate’of a, and calculate approxi- 
mate 95"% confidence limits for a. [6.} 
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*12.7 Correlation and regression 
Buwanvate data 


The marks of a class of 20 pupils in a mathematics examination, and a 
French examination are given in the table below. 


Mathematics | 10 13 17 195 21 22 22 22 25. 24 
French 10 16 1 16 17 17 18 620.0622 (18 


Mathematics 24 25. 2. £4228 ~~ 3i1 31 31 33. 45. 2427 
French 25 20 27] 18 18 24 26 29 22 25 


Since the marks are paired for each pupil, such data is referred to as 
bivariate data (as opposed to univariate data, where each item is treated 
individually; all the data so far dealt with in this chapter has been uni- 
variate, though we rarely use this term). It is generally convenient to 
represent bivariate data in the form of ordered pairs. Thus the data above 
would be (10, 10), (13, 16),.... 


Scatter diagrams 


To display bivariate data pictorially, we use a scatter diagram. This is 
obtained by regarding the ordered pairs of bivariate data as Cartesian 
coordinates, and marking the associated points on a graph: 


french mark 
ῳῦ 
Θ 
Ν 
\ 
/ 


x 


mathematics mark 


The points in the diagram all lie within a relatively narrow band, and this 
is emphasised by the boundary which has been superimposed around the 
points. This suggests some association between the x and y coordinates 
(1.e., between the mathematics and French marks). Such association is 
generally referred to as correlation. 
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The scatter diagram illustrates bi- 4 
variate data which is closely corre- 
lated (also referred to as positive or 
direct correlation), i.e., small values 
of x correspond to small values of y 
and large x to large y. An example of 
such data might be the number of 
pupils and number of staff in second- 
ary schools. 


The second diagram shows data for 
which there is little or no correlation, 
for example the number of pupils, 
and number of trees planted in the 
grounds of a group of schools. 


The third diagram illustrates negative 
(or inverse) correlation — the data 
here for example might be the num- 
bers of pupils in schools, and the 
average percentages of the school’s 
pupils per class. 


Regression 


A precise measure of correlation follows naturally from investigation of the 
related topic of regression. Whilst in the study of correlation both variates 
are regarded as having equal status, in regression one (say x, though either 
may be chosen) is regarded as the independent variate, and the dependence 
of the other variate, y, on x is investigated. To be precise, we need to find a 
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relationship between each fixed value of x, and the expected (or mean) 
value ofy for that particular x. Let us illustrate this by a theoretical example: 


Two dice, one red and the other blue, are thrown together; let x denote 
the score on the red die and y the total score on the two dice. Then the 
possible (equally likely) values of y for each value of x are shown in the 
diagram: | 


ὕ 


Now when x = 1, the expected value of y is equal to 
4(224+34+44+5+6+4+7) Ξ 4.. 


This point is marked on the diagram by a cross, and equivalent points for 
the other values of x are similarly marked. The straight line upon which all 
these points lie is then referred to as the regression line of y on x, and its 
equation in this case is y = x + 3.5. 

Similarly in the diagram below we have the regression line of x on y, 
which we see has equation y = 2x. | 


y 


In this case y has been regarded as the independent variable, and for a 
particular total score, we are considering the expected value of the score 
on the red die. 
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This example, of course, is an idealised probability model. In general, 
the relationship between x and the equivalent expected value of y will not 
necessarily be linear. But whatever the relationship, linear or otherwise, we 
shall restrict our task (at this stage) to finding a straight line approximation. 
Such an approximation is generally referred to as the line of regression of 
y on x, though it must be emphasised that such a line, derived from a 
bivariate sample, will only provide an approximation to the true regression 
relationship in the background population from which the sample is 
drawn. 

There is of course no single criterion for determining the ‘best’ line of 
regression for a particular set of data, but in practice the most useful one is 
known as the least squares criterion, and the line so obtained is called the 
least squares line of regression of y on x: 


If d; is the distance of the point P; from the line / in the y-direction, and 1 is 
chosen in such a way that >> d? is a minimum, then / is the least squares 
line of regression of y on x. 


If instead of the line of regression of y on x, we need that of x on y, the 
least squares criterion is still used, but the deviations from the regression 
line are in this case measured in the x-direction: 


y 


The obvious question arises: why should we choose this particular 
criterion? Why not the more obvious one of minimising the sum of per- 
pendicular distances from the line? Our original definition of the regres- 
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sion relationship throws some light on the choice of deviation parallel to 
the axes, but the main justification for the criterion is on more advanced 
mathematical grounds. On a practical level, however, the algebra follow- 
ing from the least squares criterion is simpler than that arising, for example, 
from considerations of perpendicular distance, and the very neat and con- 
venient results are in themselves some justification for the method. 


Derwwation of the line of regression of y on x 


Let the points P,, P,,..., P;,..., P, of the scatter diagram have co- 
ordinates (*,, 41), (%2, Yo)>--+ (Xi, Yi)5--+ (Xn> Yn). Now if we take the 
equation of the regression line to be y = a + ὄχ, then 


deviation d; = y; — (a + ὀχ) 


We therefore need to find values of a, ὦ such that 


S=Y (y; — a — bx)? isa minimum. 


T 


We minimise S by regarding it as a function of a, ὦ and then equating the 
partial derivatives of S with respect to a, ὁ (written 0S/da, 0S/0b) to zero. 
(OS/da denotes the derivative of S with respect to a, while regarding b as a 

constant. Similarly 0S/0b is the derivative of S with respect to ὁ, keeping a 
constant. A thorough treatment of partial differentiation can be found in 
most books on advanced calculus, but a detailed knowledge of the subject 
is not essential for the purposes of this section.) 

ὯΔ) 
Νονν ao -2 δ᾽ (y; — a — bx;) 

os 
and a -2 xi(y; m= ὦ — bx;). 


So if 0S/da = 0 and 0S/éb = 0, 


we obtain the equations: 


Σ, ίψι -a— bx) =0 => Σ΄ ψι -- πα -- ὁ Σ᾽ x; Ξ Ὁ (1) 
Σ, χιίψι — @ — bx;) = 0 τὸ Σιν τ αι χι τ a SO (2) 
and in their most convenient form, these equations can be written: 

Σ γι Ξ πα -- δ δ᾽ x, (1) 


May = oY + oY αἱ. (2) 


Dividing equation (1) by 2, we obtain 


] ] 
ΣΝ a+ δῸΣ, αὶ 
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and writing the means of x and y as x and ¥, the equation becomes 
y= at ὄχ. 


This tells us that the regression line passes through the ‘mean point’, 
(x, 7). so that the equation of the regression line can be rewritten as 


ψ — Ὑ = δ(χ — x). 


Hence we only need find the value of ὁ from equations (1) and (2). This 1s 
done by multiplying equation (1) by >) x; and equation (2) by n, and then 
subtracting (1) from (2). This gives: 


n> Xi = 0 X;) (> Yi) = b[n Σ᾿ x} == 63 x;)*] 


and dividing through by n’, 


Lei  (L%\(LH) _ | Le _ (Li) |. 
n n nj n n 
Now a - (553) = 57, the variance of x, 


and we also write 


Xie _ (=) (2!) = S,y, which is called the covariance of x, y. 


n nN n 


(5,, is a simplified statistical version of the covariance of x, y which arose in 
12.5.) 


This quantity, σαν δα, known as the coefficient of regression of y on x, is there- 
fore the gradient of the line of regression. 

The equation of the line of regression can therefore be written in its most 
convenient form as: 


If, alternatively, y is taken as the independent variable, it is clear by 
symmetry that the line of regression of x on y is given by 


Sxy 


χ -- ἰ τ -Φ (y — ἢ). 
Sy 

Now let us return to our original bivariate data, where the two variates 
were mathematics (x) and French (y) marks. We will see how the French 
mark depends linearly on the mathematics mark by obtaining the line of 


regression of y on x. 
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x y x xy 
10 10 100 100 
13 16 169 208 
17 1 989 187 
19 16 361 304 
21 17 | 44] 357 
29 17 484 374 
29 18 484 396 
22 20 484 440 
23 29 529 506 
24 18 576 439 
24 25 576 600 
26 20 676 520 
27 21 729 567 
28 18 784 504 
31 18 961 558 
31 24 961 744 
3] 26 961 806 
34 29 1 156 986 
35 29 1 225 770 
37 25 1 369 925 


Σ x; = 497 vy, = 393 Σ χξ = 13315 YLixy; = 10 284 


> x; Σ Χ 
= 665.75 —— = 514.2. 
20 20 : 


=> χα = 24.85 ἢ = 19.65 


Hence 52 = 665.75 — (24.85)*; τ» = 514.20 — 24.85 x 19.65 


x 


5665 75. 6 788 = 514.20 — 488.30 
= 48.93 = 95.90 
Syy 29.90 
e537 
ὩΣ α΄ = 


Therefore the line of regression of y on x 15 
y — 19.65 = 0.537(« — 24.85) 
= 0.537x — 13.35 
=> y = 0.54x + 6.30, to 2 decimal places. 


In a similar way, we can calculate the regression line of x on y to be 
y = 0.84x — 1.30. 


The scatter diagram for the data is reproduced below with the two 
regression lines superimposed. The intersection of the two lines is the point 
(%, ¥), since of course we have shown that this point must lie on both lines. 
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Example 1 (Grouped data) 


In a sample of 150 married couples, the relationship between age of 
husband and wife was investigated. The results are illustrated in the table 
below, where the numbers in the table are frequencies (e.g., 5 couples 
with both husband and wife in the age-range 16-19). 


Age of Age of husband (X) 
wife (Y) 16-19 20-29 30-39 40-49 50-59 60-69 70-79 


Find, as conveniently as possible, the two lines of regression for the data. 


We first of all label each interval by its average value, so that for instance 
the interval 20-29 (which includes ages actually up to 30), is labelled as 25. 
We then choose 45 as a working zero for both X and Y and scale down the 
values of X, Y by 10 to obtain the transformed (or coded) variables x, y. 
Working in terms of x, y will simplify the necessary arithmetic. The table 
then takes the form shown opposite, and we add the columns to obtain the 
frequencies ἧς, and the rows to obtain f/,. The columns and rows signifying 
xf, xf, yf, and y*f, can then be easily completed and their totals cal- 
culated. 


— 
"αἱ 
— 
No 


| 4 
120 257 O 24 52 


43.7 


So DA = LA = Lf = 150. 


and Yxf,=—-452 => #= Sy = 70.301 
Xf, = 3307 = ἘΝῚ = 2.205 
Lyfy = —636 => y~ 2th - 0.424 
Σ f, = 356.3 => ee = 2.375 


It remains to calculate Σ᾽ xyf. This is done simply by drawing out the 
table again, but with the entries this time representing xyf rather than just 
the frequency f- Then the numbers in-each row are added to obtain a 
further column, and the total for this column gives Σ᾿ xyf. 


16.2 
5.4 00 14 
2 28 
0 0 


SS: oS 
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So Vayf=317 = Se = 2.133. 


Hence s2 = 2.205 — (—0.301)? = 2.114 
2 = 2.375 — (—0.424)* = 2.195 
= 2.133 — (—0.301) x (—0.424) = 2.005. 


δ 


Sxy 


Now transforming back to the original variables X and Y, 


X = 10% + 45 = 41.99 
Y = 107 + 45 = 40.76 
s = 10052 = 211.4 


i= 100s; =) 219.9 
Sxyy = 100s,, = 200.5. 
200.5 
So the regression coefficient of Yon X = ἜΣ 0.948 
00.5 
and the regression coefficient of X on Y = oa = 0.913. 


Therefore the equation of the regression line of Y on X is 


Y — 40.76 = 0.948 (X — 41.99) 
> Y = 0.9X + 0.9, to 1 decimal place, 


and the equation of the regression line of X on Y is 


X — 41.99 = 0.913 (Y — 40.76) 
> Y = 1.1X — 5.2, to 1 decimal place. 


Exercise 12.7a 


1 (1) Obtain the heights (x) and masses (y) of 10 people of about the 
same age. Represent this data on a scatter diagram, and draw by eye 
through the points of your diagram what you consider to be the line of 
‘best fit’. Calculate the equations of the lines of regression of y on x and of 
x on y, and draw them on the same diagram. 

(11) Obtain the heights and masses of a larger number of people, and group 
this data in a suitable way. Then calculate the equations of the two regres- 
sion lines, and compare with the results of part (1). 


2 An experiment was conducted to discover the effect of adding a certain 
compound to the diet of mice. The results of the experiment are: 
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No. of units of 
compound added (x) Ϊ ) 3 4 9 6 7 8 


Mean gain in mass of 
mice (y)/mg 8.7 10.9 9.2 109 11.6 11.1 12.3 13.8 


Find the equation of the line of regression of y on x. 


4 In the following table W g is the mass of a certain chemical substance 
which dissolved in water at T °C: 


Calculate the equation of the line of regression of W on 7. Use this equation 
to obtain a tentative value for W when T = 56. (M.E.1.) 


4 A sample of eight families of a certain monkey species was chosen in 
which each family had just a single fully grown male offspring, and the 
heights of father and son for each family are given below. 


Height of father/cem | (x) 67 64 70 71 #65 67 68 69 


Height of son/cm (y) 68 62 70 73 64 69 71] 69 


Plot the data on a scatter diagram. Find the equation of the line of regres- 
sion of y on x, and superimpose this line on the diagram. 


5 The breadth (x) and length (y) of 12 leaves from a certain tree are given 
below: 


Breadth/mm 39 36 35 955 30 51] 34 28 2425. 50 19 29 


Length/mm 81 80 74 89 71 73 76 75 84 81 77 82 


Plot the data on a scatter diagram. Find the equations of the regression 
lines of (7) y on x, and (11) x on y, and draw these lines on the scatter dia- 
gram. 


6 Acertain machine processes raw material. The purities (x) of 10 batches 
of raw material are given below along with the yield (y) from each batch. 


Purity (x) |0.49 0.41 0.53 046 0.47 0.39 0.51 042 0.49 0.52 


Yield (y) | 24 21 26 25 20 ae δ DG [55 97 


Plot the data ona scatter diagram. Find the equation of the line of regression 
of y on x, and superimpose this line on the diagram. 


7 The chronological ages and the reading ages (in years and months) of 
10 children from a primary school are given on next page: 
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Chronological age |7.5 7.11 8.1 8.2 88 9.0 9.4 9.10 10.2 10.6 


Reading age 7.8 7.10 8.0 7.9 89 9.1 88 9.10 12.1 10.7 


Find the equation of the line of regression of reading age (y) on chrono- 
logical age (x), if x and y are measured in months. 


8 To test the effect of a new drug, twelve patients were examined before 
the drug was administered and given an initial score (7) depending on the 
severity of various symptoms. After taking the drug they were examined 
again and given a final score (1). A decrease of score represented an im- 
provement. ‘The scores for the twelve patients are given in the table. 


Patient 1 2 3 4 5 6 7 8 9 10 11 12 


Initial (7) | 61 23 8 14 42 34 32 31 41 25 20 50 
Final (F) | 49 12 3 4 28 27 20 20 34 15 16 40 


Calculate the equation of the line of regression of F on J. 
On the average, what improvement would you expect for a patient 
whose initial score was 30? (M.E.I.) 


Score 


9 The following table gives the results of 4 estimations of a variable y at 
each of four values of a variable x. Calculate the equation of the regression 
line of y on x. 


Estimate the mean value of y when x = 3. (M.E.1.) 


10 The marks of 100 pupils in English language and English literature 
examinations were graded from | to 9 and their results are given in the 
frequency table below: 


English language (x) 
lL 2 3 4 5» 6 7 8 9 


English 
literature 


(y) 


woOmnaI Doh WON καὶ 
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Find the equations of the lines of regression of (1) y on x, and (11) x on y. 


11 The following table gives, in coded form, the breadths and lengths of 
1 306 human heads. 


Frequencies of heads 
X (length) 


Y 
(breadth) 


4 18 70 177 291 360 231 109 


The length X is coded with 18.85 cm as origin and the breadth Y is coded 
with 15.15 cm as origin. The interval of grouping is 0.4 cm in both cases. 
Calculate the average length and the average breadth of the heads, and 
the equation of the line of regression of breadth on length. (M.E.I.) 


12 ‘The assets of a certain company have grown over the period 1850 to 
1966 as shown in the table: 


Year X | 1850 1870 1900 1930 1960 1964 1965 1966 


Assets/£10° Y 5 56 161 464 1498 2250 2527 2905 


Calculate the line of regression of log,, Y on X. (M.E.I.) 


13 Six pairs of values of x and y are given in the following table: 


y| 15 31 49 7.0 94 12.2 


It is thought that these values are connected by a relation of the form 
y = ax + bx’. By writing Y = y/x, and X = x’, reduce the relation to a 
linear one and calculate: 

(1) the means of X and Y; 

(11) the variance of X; 

(211) the covariance of X and Y; 

(10) estimated values of a and ὁ on the assumption that there is a linear 
relationship between X and Y. (O.c.) 
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The correlation coefficient 


Let us consider further the general line of regression of y on x: 
- xy - 
γ-πῦπς-ν αὶ - ἅ). 
x 


If we divide through by s,, the standard deviation of the y values, the 
equation can be written 


Now in this equation (y — #)/s, and (x — )/s, are standardised versions 
of the variates x and y, and the quantity s,,/s,5,, which is symmetrical in x 
and y, is known as the coefficient of correlation of x and y; or, more exactly, the 
product-moment correlation coefficient. 

The correlation coefficient is normally denoted by the letter 7, and if the 
standardised variates are written as x’ and y’, the regression lines of y on x 
and x on y become: 


y =rx’ and x’ = ry’ respectively. 
’ | / 
=> = =X 
; 
The limits on the value of 7 can now be determined, using a result known 
as Cauchy’s inequality : 
If a;, δ; are real numbers (for 7 = 1, 2,..., 2) 


then (Σ a,;)? < (Σ αὐ) (Σ δ). 
[For consider the expression Σ᾿ (Aa; + 5;)*, where A is any real number. 


Since each term of the summation is squared, and hence positive, it follows 


that - 

δ᾽ (Aa; + Ἦ δὴ ὶ Ὁ for all 
=> ν᾽ (Aa? + Qhab; + 67) 5 0 
> MY αὐ + 2¥ ab, + > δὲ 0 for alld. 


Now we know that in general if ax? + 26x + ὁ > 0 for all x, then δ" < ac 
So if we regard A? δ᾽ αὐ + 24 Y aj, + δ᾽ δὲ as a quadratic function in 
A, it follows that 


(> a;b;)” Ξ os qj) (> b;).] 


Now for the correlation coefficient, 7, 


<. > --Σ 


50, ane Cauchy’s inequality with (x; — Χ) = a, and (y; — 5) = δι, we 
see that 7? < 1. 
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Let us now look at three particular values of r: 


(ἡ r=1:  Inthis case, the regression lines ofy on « and x on y coincide, 
and if standardised they have gradient 1; this means that 
all the points of the scatter diagram are collinear, and we 
say that there is exact positive (or direct) correlation between 
the variates. 


(1) r =O: The regression line of y on x is horizontal, and that of x on y 
is vertical. This indicates a complete lack of linear associa- 
tion between the variates, which are said to be uncorrelated. 


(111) r = —1: The regression lines coincide, and if standardised have 
gradient —1; small values of x correspond to large values 
of y and vice-versa, all points being collinear. In this case 
we say that there is exact negative (or inverse) correlation 
between the variates. 


Significance of the correlation coefficient 


If, in a particular example we obtain a value of r close to 1 or a value which 
is nearly zero, then little further analysis is necessary ; but how do we inter- 
pret a correlation coefficient of (say) 0.5? We wish to know its significance, 
that is the probability of obtaining such a high value if the variates are 
uncorrelated (i.e., with the null hypothesis that r = 0 for the background 
population). This will depend on the size of the sample from which r 15 cal- 
culated: the more data we use, the more confident we can feel about the 
value of r. The mathematics involved in the relevant significance test 15 too 
involved for this text, but we can (without at present being concerned about 
the mathematical justification) use a graph like that overleaf. This shows 
the size of samples required to give 5%, 1%, and 0.1% significance levels 
for values of the correlation coefficient 7 (using a one-tailed significance 
test). 

So, using the graph, we can say that a correlation coefficient of 0.5 
obtained from a bivariate sample of 15 pairs is significantly different from 
zero at the 5% level, but if obtained from a sample of only 10 pairs is not 
significant. Similarly a value of r = 0.8, obtained from a sample of size 9, 
is significant at the 1°%% level, but not at the 0.1% level. 
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1-0 


correlation coefficient 7 


30 


sample size (no. of pairs of bivariate data) 


Warning 


A significant correlation coefficient does not necessarily imply causal con- 
nection between two quantities. There is, for instance, a correlation 
coefficient of 0.75, which is significant at the 5% level, between the number 
of indictable crimes committed and life expectation in Great Britain 
between 1900 and 1960 (measured at 10-yearly intervals). But we would 
hardly accept that an extended life-span increased a person’s criminal 
tendencies; nor even vice-versa! 


Example 2 


Calculate the product-moment correlation coefficient for the mathematics 
and French marks given at the beginning of the section, and investigate the 
significance of this coefficient. 


We have already calculated Syy = 25.90 
and οἱ = 48.23 => 5. = 6.94. 
Now for the data, δ γῇ = 8 159 
2 
= Xi _ 497.95 
n 
=> s* = 407.95 — (19.65)? 
= 407.95 — 386.12 
= 21.83 
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5.90 
Therefore r= — : 


= --- - ----- _ = 0.80. 
SS 6.94 x 4.67 


Now this value of r was calculated from 20 pairs of data, so using the sig- 
nificance graph we can see that this correlation coefficient 15 definitely sig- 
nificant at the 0.1% level. This is therefore extremely strong evidence of 
correlation between the mathematics and French marks in the population 
from which the sample was drawn. 


Example 3 


The heights and masses of a random sample of 10 first-formers from Watford 
Grammar School are given below. 


Height/em | 156 151 152 160 146 157 149 142 158 141 


On the basis of this sample, how significant is the correlation between 
height and mass? 


Calculation will be considerably simplified if we subtract 150 cm from 
each height and 40 kg from each mass. This is equivalent to translating the 
axes of the scatter diagram, so does not affect the value of the correlation 
coefficient. The data therefore becomes: 


x y x y xy 
6 7 36 49 42 

l == ] 4 a 

2 4 4 16 8 
10 15 100 225 150 
—4 6 16 36 == 26 
7 —1 49 Ι ὅς ἢ 
—| 5 ] 25 =5 
—8 —10 64 100 80 
8 5 64 25 40 
—9 —8 81 64 72 
12 21 416 545 354 


Therefore x = 1.2 and y = 2.1. 
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So, working correct to 3 significant figures 


δ. τῷ 41.6 -- 1.25, s§ =545-—2.17, 5, = 354-12 x 21 


y 


Ι! 


= 40.2 = 50.1 = 32.9, 
=> 5s, = 6.34, Sy = 7.08, 
32.9 
H = —__—. = (7 
ene aq age 


and using the significance graph, we find that this correlation coefficient is 
significant at the 1% level. 


Exercise 12.7b 


1~6 Calculate the correlation coefficients for the data in exercise 12.74, 
nos. 1-6, and comment on the significance of the coefficient in each case. 


7 Calculate the coefficient of correlation between the mass of the heart 
and mass of the liver in mice, using the following data. 


Mass of organ 
Heart/10°* g 20 16 20 2] 26 24 18 18 


Liver/10°-*g¢ | 230 126 203 241 159 230 140 242 


What do you conclude from this analysis? (M.E.I.) 


8 The following table gives the number of goals scored at home and away 
by ten teams in the First Division of the Football League, part way through 
the 1972~3 season: 


Away lO 14 112 195 6 610 8 9 "6 7 


Calculate the product-moment correlation coefficient for home and away 
goals. 


9 The intelligence quotients of 8 students were assessed, and the time (in 
minutes) it took them to complete a certain piece of work was measured. 
Their I.Qs. and times are given below: 


Is the negative correlation between these significant? 


10 Calculate the correlation coefficient for the data of exercise 12.74, 
no. 10. 


11 Calculate the correlation coefficient for the data of exercise 12.72, 
no. 11. 
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12 Calculate the product-moment correlation coefficient for the bivariate 
data given in the following frequency table: 


13 Ten sets of readings for the variates x, y, and z are given below. 


x|2 3 5 7 8 IW 13 16 18 19 
y|/8 5 12 MW 2 9 24 17 12 25 
z|8 10 10 7 12 414 115 21 28 26 


Calculate the coefficients of correlation between (1) x and y, (11) y and z, 
and (i) z and x, and investigate the significance of each. 


Rank correlation: Spearman’s rank correlation coefficient 


Six athletes competed in both a 1 500 m race and a 400 m race, and finished 
in the following positions: 


Can anything be said about correlation of performance at these two 
distances? 


If the times for the athletes were given, then the product-moment correla- 
tion coefficient for these times could be calculated, as in previous examples. 
Here, however, the only information is about position or rank. ‘Therefore 
the product-moment correlation coefficient of the ranks is calculated, and 
is referred to as the coefficient of rank correlation. 

Since ranks (or positions) are less informative than actual measurements, 
the rank coefficient is correspondingly less informative than the product- 
moment coefficient. However, since only integers are involved, the rank 
coefficient does have the advantage of being easily calculated. Itis evaluated 
by means of the formula: 
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The derivation of this formula follows the worked examples. 

This rank correlation coefficient was first defined by Spearman in 1906, 
and until recently has been the most widely used. But there are others, so it 
is usually called Spearman’s rank correlation coefficient and its significance 
graph is given below: 


rank correlation coefficient ? 


sample size 


Returning to the data for the two races, p is evaluated as follows: 


Athlete A B 9 D E Ε = 


difference d —-] -l] --Ὁ 3 ] 0 

42 11..4.9ϑ 1 0 Σα5- 16. 
Η = 6 x 16 
ope eo τῆι 

= ee 

= αἴ 

~~ 3 

= 0.54: 


which, using the significance graph, is not significantly different from zero 
at the 5% level. 


Example 4 


What is the rank correlation coefficient for the heights and masses of 
schoolboys given in example 3, and how significant is this coefficient? 
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The first step is to rank (in ascending order) the heights and masses. But 
the problem arises of how to deal with the two equal masses of 45 kg; 
between them, they must occupy the 6th and 7th positions, so by con- 
vention each is given the rank 6.5. (It is not difficult to envisage how this is 
extended to more than two equal ranks, each being given the average of 
the ranks they replace.) The heights and masses are now ranked in the 
following table: 


Height/cm | Mass/kg Height rank Mass rank d d? 
156 47 7 9 2 4 
151 38 5 3 2 4 
152 44 6 i) ] ] 
160 55 10 10 0 0 
146 46 3 8 5 25 
157 39 8 4 4 Ι6 
149 45 4 6.5 2 6.25 
142 30 2 ] Ι | 
158 45 9 6.5 2.5 6.25 
14] 32 l 2 ] ] 

64.5 

So n= Ι0θ and >a = 64:5 

ae ΒΨ 6 x 64.5 | 
P=" 10 x 99 

= ] — 0.39 
= 0.61; 


and using the significance graph, we find that this coefficient is significant 
at the 5% level. (As compared with the product-moment coefficient which 
was significant at the 1% level.) 


Derwwation of the formula for Spearman’s rank correlation coefficient 


The product-moment correlation coefficient is required where the two 
variates, x and y, are ranks, each taking values 1, 2,..., n in some order 
or other. 

We shall assume the standard results (see chapter 6) 


2 = 5n(n + 1) 
ye = gn(n + 1)(2n + 1). 


Since we need to calculate p = 5,,/s,5,, it is necessary first of all to evaluate 
Sy, Sy and Sy. 


yy = >i? = gn(n + 1) (20 + 1) 
1 


> 
ζῶ 
re) 
“Ms 
δὲ 
“ΙΝ 
| 
Ms 


H|- Ss fe w 


=> ἘΞ 
=4(n + 1)\(Qn + 1) -- F(n + 1)" 
seedy) (agi ds Os Banc δ] 
= k(n + ηα - ἢ =e? - ἢ) 
Ξ-» 5, = ψ{πξ|α -- 1)], ἀπά similarly s, = [ποι -- 1)] 
=> σιν = qa(n* -- 1) 


We now use the identity: 


xy, = xt + τυ} — 20% — γὴ 


=e Σ, αν = 2D 2 ἘΣ ie ΞΟ (* — y;)° 
= ἐπία + 1)(Qn+ 1) -- ΑΣΣ᾿ 4 
=> ἘΣ xi = ξ(η + 1)(2n + 1) - Σ . 


l 
Now Syy = — δ tH — ἀνῇ 
n 


d? 
= yah t NQrt 1) -S4 - ἴὰ τ) 
᾿ LG 
Sag) On 
Therefore p = oe: 
5555 
ki? -- ἡ - Σ αὐ | 6 a? 
7 b(n? — 1) > n(n? — 1) 


It should be noted that we have assumed in these calculations that {πὲ 
ranks are all different. If, however, equal ranks occur, Σ᾿ x7 and/or Σ᾿ Y; 
are no longer equal to gn(n + 1) (2η + 1), and the above formula is no 
longer strictly valid; but the error involved is generally very small. 


Kendall’s rank correlation coefficient 


Another coefficient of rank correlation, known as Kendall’s rank correla- 
tion coefficient (t), is derived quite differently, but has the same range of 
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values (—1 to 1) as the other correlation coefficients, and values of τ 
equal to —1, 0, and 1 have the same meaning as 7 and p. It is a measure 
based on a simple principle of agreements and disagreements between the 
orders in which the objects are placed. 

If we have a total of n objects (A, B,...) ranked in two different ways, 


then we can choose 2 = ἐπίη — 1) different pairs, such as A and B. 


Now for the two rankings, the relative positions (A before B or B before A) 
can agree or disagree. So the maximum number of agreements, and the 
maximum number of disagreements of these pairs are both equal to 
sn(n — 1). Kendall’s coefficient is calculated by subtracting the total 
number of such disagreements from the total number of agreements, and 
then dividing by the maximum possible number of agreements, 1.e., 


no. of agreements — no. of disagreements 


gn(n — 1) 


Clearly if both rankings are exactly the same, all pairs will agree in 
relative position, so that the number of agreements will be 5n(n — 1) and 
the number of disagreements will be zero; hence τ will equal 1. Similarly 
for total disagreement τ will equal —1. 


The significance graph for τ is given below: 


rank correlation coefficient τ 


sample size 


At first glance it would seem a very lengthy procedure to calculate 1, 
but in practice it can be calculated very quickly and conveniently, by 
either of the two methods in the following example. 
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Example 5 


In a brass-band competition, two judges place the eight bands in the 
following order: 


Ist judge 3 6 7 2 4 8 l 5 
2nd judge 5 7 4 ] 3 8 2 6 


Calculate Kendall’s rank correlation coefficient for the two judgements. 


Method 1 


Re-order the bands so that they are in the correct order of merit according 
to the first judge. Take the rank given by the second judge to the first 
band, G, that is 2. Now count +1 for every number to the right which is 
greater than 2 and —1 for each number which is less than 2; these totals 
are given below the rank. Repeat this procedure for the next rank in the 
second row, remembering to count only numbers to the right of it. And so 
on for each rank in the second row. The positive numbers so obtained 
represent agreements, and the negative numbers disagreements, so that 
their sum gives 0, the total number of agreements less the disagreements. 


G D A E H B ΘΟ F n=8 


] 2 3 4 9 6 7 8 
2 l Ὁ 9 6 7 4 8 


6 3 4 ] ] 
-] 0 -2 Oo -Ἰ -| 
5 6 ] 4 ] 0 ] ὃ τ 18 
ὃ 18 9 
Ο τῆ ἘΠ — = — 


Method 2 


Set out the bands in the order decided by each judge. Then join the same 
letter in each ranking by a straight line. Let c denote the number of crossings. 


Ist judge G D A E Η Β C ᾿ 
ond jud x Pets 
De SE a G E Cc Ὰ H BF 
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Then ὃ = 3n(n — 1) — 26 


Ac 
=> (on -- ----.-. . 
πίη -- 1) 
In this case, ὁ = 5. 
20 
5 Ξ ] - 
° ᾿ 8x 7 
Ξ- 1 -- 
=~ = 0.64. 


(The justification for this second method is left as an exercise for the reader. ) 

So Kendall’s rank correlation coefficient is in this case equal to 0.64; 
which, using the graph, is found to be significant at the 5% level. 
Example 6 


Calculate t for the race positions given earlier in this section. 


| 7 7 
Hence Tt = 


a S047 
5x6x5 15 


and this value of t compares with the value (already calculated) of Spear- 
man’s coefficient, p, equal to 0.54. 


Exercise 12.7¢ 


1 Six varieties of chocolate, denoted by A, B, C, D, E, F are ranked for 
taste in the following order (best first): F GC A Ὁ B E. The ranking on price 

(most expensive first) is C F BAED. Calculate a rank correlation co-- 
efficient, and comment on the result. [6 


2 Αἴδνη]αρε féte the fruit cake competition was Judged by the vicar and 
the local squire. They placed the entries as follows: 
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Calculate a rank correlation coefficient. Can you deduce anything about 


the vicar’s and squire’s tastes in fruit cake? 


3 ‘['welve chemistry students were each given a theory and a practical 
examination. Their positions in these two examinations were as follows: 


Calculate (1) Spearman’s and (11) Kendall’s rank correlation coefficient 
for the data, and test the significance of each coefficient. 


4 Rank the data of exercise 12.7a, no. 4, and calculate Spearman’s rank 
correlation coefficient. 


5 Rank the data of exercise 12.7a, no. 5, and calculate Spearman’s rank 
correlation coefficient for the lengths and breadths of leaves. 


6 Rank the data of exercise 12.7a, no. 6, and calculate Spearman’s co- 
efficient. Is the rank correlation coefficient between parity and yield 
significant? 


7 Rank the data of exercise 12.7a, no. 7, and calculate (1) Spearman’s 
and (1) Kendall’s rank correlation coefficient for the chronological and 
reading ages of the children. 


8 ‘The batting and bowling averages of seven of the members of a cricket 
team are put in order as follows: 


Batting order l 2 3 4 5 6 7 
Bowling order | 7 4 6 5 3 | 2 


(1)] Calculate Kendall’s rank correlation coefficient between the two 
orders. 

(11) Is this significant at the 1% level? 

(111) Ifthe positions of A and C in the bowling order were reversed, show 
that Kendall’s r.c.c. would be significant at the 5% level. (S.M.P.) 


g A,B, C, D, E was the order of merit for the five marrows entered in a 
local vegetable show. One admirer put them in the order B, A, C, D, E; 
show that he got 0.8 as a Kendall coefficient of rank correlation on com- 
paring his order with the official result. List all other possible orders which 
would give the same coefficient when compared with ABCD E. How 
many different orders could there be for the five marrows? How likely is 
he to obtain a coefficient as high, or higher than he did, purely by chance? 

(S.M.P. ) 


10 At the end of a particular season, the goals scored by the ten teams 
forming a football league, as compared with their league positions were as 


MISCELLANEOUS PROBLEMS [12 375 


follows: 


Position I 2 3 4 i) 6 7 8 9 10 


Goals scored 34 19 31 24 26 20 22 18 #19 18 


By ranking the number of goals scored (in descending order), calculate a 
rank correlation coefficient between league position and goal score. Is this 
coefficient significant? 


11 Eleven boys from the same class took part in a sponsored walk for a 
charity. The distance each walked, and the amount per kilometre that he 
earned are given below. Calculate the Kendall rank correlation coefficient 
for these two sets of figures. 


Distance/km | 25 23 22 21 19 #=%18 12 10 9 5 38 


Rate/pkm™'| 14 16 12 13 1] 7 9 6 10 4 8 


Comment briefly on the result. (S.M.P.) 


Miscellaneous problems _ 12 


1 Ina game of Ludo a six has to be thrown with a die before each of a 
player’s four counters can be moved. Find an expression for the probability 
that a player will be able to move his fourth counter with his tenth throw. 

Find the probability, for a binomial distribution with parameters 
n, p and q, that r trials are required to obtain k successes. 


2 A certain device, used for measuring Earth tremors in a seismological 
station, can only maintain the required degree of sensitivity for a total of ten 
tremors, and after this is disposed of. If such tremors occur randomly, 
but averaging 4 per day, find the probability density function for the useful 
lifetime (in days) of such a device, and use this to show that the mean life- 
time is 23 days. What is the variance of the lifetime? 

Find the p.d.f. if the device can be used for ἀ tremors and tremors 
average A per day; the probability distribution with this p.d.f. is referred 
to as the gamma distribution. 


2 For a certain type of bacterium, the time x from birth to death is a 
random variable with p.d.f. (1 + x)~?, (x Σ 0). A culture of such bacteria 
is routinely inspected at regular intervals of time. 

One bacterium, inspected at time ¢ after birth, is found to be dead already. 
Find the probability that this bacterium has been dead for at least time 
κί(0 « ko). 

Another bacterium of the same type is alive at time ¢ after birth, but is 
found to be dead by the end of a further time ¢. Find the probability that 
this bacterium had been dead for at least time kt(0 < k < 1). (M.E.I.) 


4 When a patient A arrives at a doctor’s consulting room he will be seen 
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at once, and will occupy the doctor for 30 minutes; he is equally likely to 
arrive at any time between 2.00 and 3.30 p.m. Patient B, who comes inde- 
pendently, is equally likely to arrive at any time between 2.00 and 4.00 p.m. 
(1) What is the probability that B will arrive while the doctor is seeing A? 
(11) What is B’s expected waiting time? (O.s. ) 


5 Inacertain large population of men, heights are distributed Normally ᾿ 
about a mean of 180 cm with standard deviation 5 cm. Random samples 
are taken with three men in each sample and their heights are arranged in 
increasing order. In 1 000 such samples, approximately how many will 
have: 

(1) the middle height under 175 cm? 

(1) the least height less than 175 cm? 

(111) the least height between 175 and 180 cm? (S.M.P. ) 


6 The number of eggs laid by a farmyard hen in a week is a Poisson 
variable with mean A. The chance that an egg is fertile is p. Write down the 
probability that in one week n eggs are laid, and r of these are fertile. 

By summing this probability over appropriate values of n find the 
probability that 7 fertile eggs are collected in one week. 

Show that if this number 7 is given, then the distribution of (” — 7) 
conditional on it is of Poisson form, and that the mean ofnisr + A(1 — p). 


(C.) 


7 By means of probability generators, calculate the expected value and 
variance of the number of throws of a normal unbiased die required to 
obtain the sequence 1 2 3 4 5 6 (not necessarily consecutively). 


8 Matches are put into a box five at a time until the mass of the box and 
matches combined reaches M g when the box 1s said to be full. The mass 
of an individual match is Normally distributed with mean m g and stan- 
dard deviation σ g. The mass of an empty box is Normally distributed 
with mean 5m g and standard deviation 26 g. Find the value of M such 
that there is only one chance in a hundred that a full match-box contains 
fewer than 50 matches (C.S.) 


9 Each of four players is dealt 13 cards from a pack of 52 which contains 
4 aces. Player A looks at his hand and winks at his partner, Player B which 
is a pre-arranged signal that his hand has at least one ace. Player B winks 
back to show that he has at least one ace as well. Player C looks at his hand 
and sees that he has just one ace. From Player C’s point of view what 15 the 
probability that his partner, Player D has at least one ace if: 

(1) he saw the winks and understood their meaning; 

(11) he knows nothing about his opponents’ signals. (α.5.) 


10 & is the number of successes in a binomial distribution with para- 
meters n, p; S is the number of failures. 

(1) Find the covariance of R and ὁ. 

(1) Use the expectation and variance of (R — 5), together with a suit- 
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able approximation to the distribution of (R — S) when p = 4, to find 4 
such that 


p(|R — S| > 2) = 0.05. 
(111) Calculate the probability ΓΝ: - S$ | > 8) when n = 16 and p = 5 
(M.E.I. 


9 
to two significant figures, using the exact distribution. .E.1.) 
11 (1) If A,,A,,A5,..., 4, are certain events, we know (from chap- 
ter 7) that 


P(A, + Az) = ρῥ(44) + P(A) — P(A, Ao). 
Find corresponding expressions for p(A, + A, + As) 
and p(A, + A, +:-::+ A,). 


(11) n boys go to a riotous party, each with a girl; and though the party 
ends in complete confusion, each boy also leaves with a girl. What is the 
probability that no boy leaves with the girl he brought, and to what value 
does this probability tend for a large party? 


12 Initially a machine is in good running order but it 1s subsequently 
liable to break down. As soon as a breakdown occurs repairs begin. If the 
machine is in good order at time ¢ then the probability that a breakdown 
occurs in a small interval (¢, ¢ + dé) is ἃ Ot, and if it is under repair at 
time ¢ the probability that the repair is completed in time (¢, ¢ + df) is 
β ot. Let p(t) be the probability that the machine 15 under repair at time ¢. 

Write down an equation relating p(¢ + dt) to p(t) and hence show that 


p(t) is a/(a + B) {1 — e σΤ θη, (c.s.) 


13 Two jars, one white and the other black contain a + ὦ balls each; in 
the white jar there are a white and 6 black balls and in the black jar ὁ white 
and a black. Single draws are made as follows: at the rth draw a ball is 
drawn from the white or black jar according as the (r — 1)th ball drawn 
was white or black, the colour of the ball noted, and then returned. ΤΡ, is 
the probability that the nth ball is white, show that 


(a Ἢ b) Dy = δ. (a a DD ites 


By means of the substitution 


Ja 
Pn ar? a ΠΣ 

or otherwise, determine p,,, when the jar from which the first draw is made 
is (2) chosen at random, (11) white. (ο.5.) 


14 Three deadly enemies A, B, and C take part in ἃ three-cornered duel 
with pistols. The probability of A hitting his target is 0.6, that of B 0.8, 
and C 1. They draw lots for the order of firing and each uses his best 
strategy (they all know one another’s capabilities). If they take it in turns 
to keep on firing until just one is left, who is most likely to survive, and what 
is his probability of survival? 
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15 Ina game between two players both players have an equal chance of 
winning each point. The game continues until one player has scored Λ΄ 
points. Find the probability p, that the winning player has a lead of ex- 
actly r points when the game is complete. Deduce that 


(ANP 1p ak Gy Ἐπ»; Pe ξεν 


and hence find the expected value of the lead at the end of the game. 


(c.s.) 


Matrices, determinants, and 
linear algebra 


13.1 Introduction: matrices and matrix addition 
Matrices as stores of numerical information 


Suppose the boys in a school are placed in ‘sets’ for mathematics according 
to ability, and that there are four sets in the first and second forms and five 
sets thereafter. We use the notation III, , for example, to denote the second 
set of the third forms. If conditions were ‘ideal’ or extremely rigid!, all 
the boys in I, would move up the following year into set IT, ; all those in 
IV, would go into V;, and so on. But of course, in practice, things are not 
like that: if they were, life would be very dull! Some make unexpected 
progress and earn promotion to higher sets; others do the reverse. Some 
develop a dislike of the subject, whilst others acquire a fascination for it. 
So it may be that, of the 31 boys in IV, 4 go into V, the next year, 20 into 
V., 5 into V, and 2 into V4, none into V,. Suppose we wished to give an 
account of the flow of boys through the school. One way would be to 
describe the composition of the various sets in words: ‘of-the 31 boys in I,, 
18 went into III,, 6 into II,, 4 into III,, none into III, and | into the 
lowest set, III,, while.2 boys left to join other schools, and so on...’. 

One diagrammatic way of representing this data might be as follows: 


WAL 
OS TEESE 
18 2 ,7 PPS ἢ ἱ 

YE" Wi 10 Δ} a 


III, ΠῚ, Ill, ΠῚ, Ill, Leave 


New 


nN 
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A similar diagram showing the ‘flow’ between I’s and II’s, III’s and IV’s, 
and between IV’s and V’s would give some sort of a picture of the ‘flow’ 
of boys through the school. Though this may be an improvement on the 
dry prose of the descriptive paragraph, in that one can ‘see’ the movement 
by following the various arrows, it may be objected that the representation 
is somewhat lacking in clarity! 

What is the best way of depicting this information? Surely by means of a 
two-dimensional array, or table, or matrix, as follows: 


From 


To 


Leave 2 0 l 0 ~ 


This is usually written 


IOs 2 7 200 
6 10 8 3 2 
4 9 8 6 0 
O 4 11 10 O 


I ot J] 12 1 
2 0 1 O 90 


and, as it has six rows and five columns, is called a6 Χ 5 matrix. 


This method of displaying the data can scarcely be improved. Each row 
of the matrix shows clearly the composition of the new III’s sets, e.g., that 
set IIL, consists of 4 boys who were in last year’s II, , 11 from II,, 10 from 
II,, and 1 new boy. The row, or ‘list’, (0, 4, 11, 10, 1) may, in fact be 
regarded as a row vector, and the 6 X 5 matrix as a collection of six such 
row vectors. Similarly the columns show the distribution of the boys in the 
old II’s, the final column vector 


0 


Oo —_ = © bd 
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is a list indicating the disposition of the four new boys between the five sets 
(the final entry may be deemed to be zero, since a new boy is hardly likely 
to leave before even arriving — though it has been known!). 

A large school with an intake of, say, 300 pupils per year would require 
larger matrices (say 12 x 12) to describe the composition of the new sets. 
A small school with 3 sets per year may possibly have a policy of moving 
boys only in exceptional circumstances, and it could be that a ‘ promotion 
matrix’ for one level would read: 


28 | 0 
as a | 
O 0 27 


Notice that all such matrices are nothing more than stores of numerical infor- 
mation (or just tables, if you like) and you can imagine how such systems of 
matrices are useful for describing the stages in an industrial process. 


Matrix addition 


We can now show how matrix addition may be defined. Suppose five boys, 
Don, Joe, Ken, Tom and Vic tabulated their marks in maths (M), physics 
(P), chemistry (C), and biology (B) during each of two exams, as follows: 


lst exam 2nd exam 


ΜΡ CG B ΜΡ ἃ Β 


Then the following table would give their aggregate scores (out of ἃ 
total 200) 
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We may write this in matrix form as the addition of the two 5 x 4 matrices 
to give a single 5 x 4 matrix, thus: 


99 42 71 60 70 56 82 57 125. 98 153 117 
30° 02° 9. 73 46 58 34 68 81 120 63 141 
69 48 40 5774+ 73 55 52 78] = [142 103 92 135 
24 38 47 49 29 32 47 58 93. 70 94 107 


70 68 67 85 76 63 61 83 146 131 128 168 


More generally, we may define the sum of two p Χ q matrices, where 
essentially each has the same number of rows (f) and the same number of 
columns (4), by asserting that the elements in the 7th row and jth column 
of the two given matrices are added to give the element of the ith row and 
jth column of the sum matrix. Again, for example, when p = 2, g = 5, 
we have 


by. be. Be by δ Nd de δ de Uh 
7 ᾿ + 6, Ag Ἔ 9. ag Ὁ Cs ag t Cy ἂς + ᾿ 


bo td, by +d, by td, b, +d, b3 +d; 


Similarly, as in the previous example, one may possibly wish to add a 
number of ‘promotion matrices’; if, for instance, one wished to find the 
aggregate movement of pupils from IT’s to III’s over a period of ten years, 
one could add those ten 6 x 5 matrices to give a single 6 x 5 matrix. 

An ordinary football league table may also be regarded as a matrix, the 
rows corresponding to the various teams A, B, C,..., and the columns 
giving the number of games played (P), won (W), drawn (D), lost (L), 
goals for (F), against (A), and total points (T): 


P W DOL F A T 
A /12 9 2 1 31 10 20 
Byll 8 3 0 28 12 19 
Ο112 8 1 3 30 15 17 


The result of a particular week’s play might be represented as follows: 


010 3311] 
1 00 1 1 4 O 
0 0 0 0 0 0 0 
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3 9 3 1 34 13 2i\fF 
12 8 3 1 29 16 #19 
8 1 3 30 15 17 


and the table 


informing us of the final state of the teams, is derived from the first two by 
matrix addition. 

Note that each row of the matrix is a list representing the performance of 
one particular team; while the column vectors are lists providing statistics 
about one aspect of all the teams’ performances, e.g., column (D) gives 
the whole story of the league from the point of view of matches drawn. 


Exercise 13.1a 


1 The following is a matrix showing the expenditure in £ of a house on 
types of fuel during consecutive quarters in 1973: 


Quarter | ΟἹ Electric — Solid fuel 


36 13 14 45 18 7 
23 18 58 15 14 0 
and 
20 21 2 18 19 0 
4] 15 8 30 22 0 


Sum the three matrices and interpret. 


2 Five bowlers had the following figures at the end of three test matches: 


+ Of course, if the final table is also to show the teams in ‘order of merit, there will usually 
have to be a reshuffling of the rows, and this could be effected by ‘pre-multiplying’ by a 
so-called ‘permutation matrix’. 
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Bowler | Overs Maidens Runs Wickets 


The performance at the end of the series of five matches was given by the 
5 Χ 4 matrix: 


213 41 418 32 
89 11 305 18 
80 12 198 11 
25 2 64 5 
37 14 90 8 
Set up a matrix showing the performance of the five bowlers over the last 


two games. 


3 The matrix below shows the readings of barometer, thermometer, rain- 
fall, windspeed and wind direction at three stations A, B, C: 


Barometer Thermometer Rainfall Wind Speed Direction 


Changes in the next 24 hours are recorded in the following matrix: 
20: ἘΞ 3: ρὸν ΕἼ. cae 5 


+38 -ἡ +21 =—11 +20 
—l1 0 Oo 22 529 


Record the final state of the weather at A, B, Casa3 x 5 matrix. 


4 IfA,B, Care p Χ q matrices, and + denotes matrix addition, prove 
that 
(A+ B)+C=A+ (B+ ἧ ΟἹ 


(the associative law of addition for matrices). 
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5 Aisap X gq matrix. Define —A, the negative of A, as the matrix A with 
every element multiplied by —1. Proceed to define matrix subtraction, 
and evaluate 


ae 7 2. oe ed 
2 0 ᾿Ξ} —9 l 4 


Linear dependence and independence: preliminary discussion 


It is worth remarking here that some of these column vectors are not inde- 
pendent of each other. For example, in the above table of football results: 


col (P) = col. (W) + col. (D) + col. (L) 


12 9 2 ] 
11 8 3 0 
= + + 
12 8 3 


Using the notation p to represent column vector (P), we might write: 
p=witd+l or l=p-—w -- (4, εἰς. (1) 


The fact that the column vectors are not independent means that there 1s a 
sense in which one of them is redundant — if we know any three of p, w, d, 
1, then the other may be found from the vector equation (1). Again, we 
have t = 2w + d, the three columns (T), (W), (D) are said to be ‘linearly 
dependent’. 

It may be observed that columns (F) and (A) are not independent, since 
their totals must be equal. Even so, it would not be correct to describe 
them as a pair of linearly dependent columns. This would only happen when 
a linear relation, of similar form to (1), connected the columns, and this must 
mean, in the case of two columns, that one would have to be a multiple of 
the other. For example, if there were an extra column for ‘points from 
matches won’: | 


18 


then clearly x = 2w, the two columns having corresponding entries in the 
fixed ratio 2:1. 
Again, suppose there was a column (G) for ‘goal average’ 
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= goals for/goals against: 
3.1 
2.33 


5. 0 


This column depends very much on columns (F) and (A), but it is by no 
means linearly dependent upon them: we cannot find values of « and β 
such that g = af + fa. 

Indeed, even the first three equations 
3.1 = 3la + 108 
2% = 28a + 128 
2 = 30a + 156 
are inconsistent. 

The relations that do exist are g, = ἤ α,, 89 = fo/ao, etc., but these 
are not what is needed for /inear dependence. 

Note that ‘vector addition’ such as we have encountered in this para- 


graph occurs in many other contexts. For example, in adding polynomials, 
we are Carrying out a similar process in adding like terms: 


3x3 — Dx? — 11 
Add 5x? — Tx — 4 


3x3 + 3x? - Tx — 15 


This might be written 
(3 -ὸ 0 -11)+(0 5 -—7 -4=(3 3 -7 -15), 


each polynomial being represented by a row vector or ‘list’ of coefficients. 
The idea of linear independence and linear dependence also arises when 
we consider the solution of linear equations. The pair of equations: 


2x -- Sy-—- 4=0 
—10x + 25y + 20 = 0 
is a perfectly good pair of simultaneous equations, but they do not have a 
unique solution. They have an infinity of solutions for the simple reason 


that the second is simply a multiple of the first, and is in effect redundant, 
the matrix of coefficients 


oe ee consisting of two linearly dependent rows. 
-10 25 20 
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On the other hand, the equally respectable pair of linear equations 
2x -- Sy—4=0 

—10x + 25y+9=0 


do not have any solution for the unknowns x, y. Here the first two columns 


-- 8 
( by and ( 2 are linearly dependent, but not the rows. We shall 


return to a fuller consideration of this (see pp. 442, 445). 


Exercise 13.1b 


1 Three batsmen have the following analyses. Fill in the blanks, and state 
which rows and which columns are linearly dependent. 


Completed 
Innings Notout Runs 100s innings Average 


2 A boy records the number of ice creams, buns and packets of crisps in 
columns a, c, and e during four successive weeks, and the money spent on 
them (prices 3p, 5p, and 2p respectively) in columns Ὁ, d, f. The total 
expenditure in p is shown in g, while ἢ shows the change from £1. 


Week Ice-cream Buns Crisps Total Change 
No. Cost No. Cost No. Cost 
a b C d e f g h 


Totals 12 36 29 125 20 90 211 189 


State which of the columns and which of the rows are linearly dependent 
(taking them in pairs, in threes, etc.). 


3 Decide which of the following row vectors are linearly dependent (in 
twos and in threes) : 

α - (2, 5, -- 3) b = (6, 15, --9) ο = (4, —10, —6) 

a= (=2,5,5) e = (6, --ὃ, -9) f= (0, 30, 0) 
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ae) 

6- = 2 
dependent. Replace one element in row 2 so that the rows may be linearly 
dependent. 


πον. 
4 In the matrix b decide which columns are linearly 


2 3 1 -4 
5 Examine the matrix |—1 2 0 2 
3 1 1 --ῦ 


for linear dependence of rows and of columns in pairs and in threes. 


6 Find χα so that the three rows may be linearly dependent: 
—3 4 0 
8 DO, ΞΞ2 
x 35] 6 
Are the columns then linearly dependent or not? 
47 The atomic weights of C, O, and H are 12, 16 and 1 respectively. The 


matrix shows the numbers of atoms of each element in various compounds, 
and the molecular weight in the final column. 


Water 0 2 l 18 
Methane ] 4 0 16 
Benzene 6 6 0 78 
Alcohol 2 6 ] 46 
Sugar 12 22 111] 352 
Acetic acid 2 4 2 60 


Show that the row vectors for water, sugar and acetic acid are linearly 
dependent. 


13.2 Linear transformations and their matrices 
Transformations in two dimensions 


It will be remembered from our discussion in the prologue to Book | that 
linear transformations} in the plane were governed by equations of the form 


x’ = ax + by 
y = ox + dy 


+ Confining ourselves to those in which the origin is invariant (i.e., excluding translations). 
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which we expressed as a matrix equation 


w\ fa b\(x 
γ' cod} \y 
These may be written more briefly as x’ = Mx, where x, x’ are the vectors 


() Ω (οΥ 2 x 1 matrices, with 2 rows and | column), 
Y) \Y 
and Mis the 2 x 2 matrix (‘ i) 

C 


The effect of this transformation is to take the grid of unit squares 
in the x, y plane into a grid of parallelograms in the x’, y’ plane, so that the 
unit square OUKV transforms into the parallelogram OU'K’'V’, 


the area of the basic parallelogram cell being ad — bc. (In the ‘singular 
case’, when ad — bc = 0, the network of parallelograms collapses into a 
straight line.) ‘The salient results are that the first column of the 2 x 2 matrix, 


] b\ fl 
᾿ , gives the transform U’ of the point U , for a =e 
᾿ 0 ¢ ἀώ7}λ0 C 


| : b 
while the second column ()) gives the coordinates of V’, the transform of 


vise (DC) -ὸ 


Using this important property we are able to write down at sight the 
2 x 2 matrices which represent such transformations as rotations about 
the origin, reflections in a line through O, shears, enlargements, and so on. 

For example, suppose we wish to reflect in the line y = x tan 0. We 
see that the image of U is the point with coordinates (cos 20, sin 20). It is 
not quite so easy to ‘read off’ the coordinates of V’, but remembering that 
OU’ 1 OV’, it is clear that V’ is the point (sin 20, —cos 20). 


390 MATRICES, DETERMINANTS, AND LINEAR ALGEBRA 


cos 26 sin 20 
sin 20. —cos 20) 


Thus the required matrix is ( 


Notice that its determinant (ac — bd) is —cos* 20 — sin? 20, and so has 
the value —1: this numerical value shows that areas are preserved, while 
the minus sign signifies that the transformation is ‘opposite’, 1.e., the 
reflection has turned the triangle OUV over, the anti-clockwise sense of 
the lettering OUV being replaced by the clockwise sense of OU’V’. 


Exercise 13.2a 


1 Write down the matrices representing the following transformations: 
(1) Rotation about O through π, ἐπ, --ἰπ, 31, --ἰπ, 2n/3, —2n/3, 37/4, 
— 32/4, tan” ' (3/4), etc. 

(11) Reflections in the lines x = 0,y = 0, χ = y,x = —y,x = 2y. 

(11) An enlargement in O with ratio 3:1; an x-stretch, ratio 2:1; a y- 
stretch, ratio 1:3; a y-shear which moves the point (1, 1) to the point (1, 0). 


2 What transformations are represented by the following matrices: 


. [2 9 ih Hao 0 τὰ 95. 4 ᾿ 1 ὦ 
6 ἢ ὡς 3. w(2) ωρ ἢ 


ΜΝ COS ἃ sin α --ηα cosa 
(viii) (“ (ix) wa 
sina —cosa/, cosa sing 


cos 40° —sin 40° ἢ 2 4 ἢ 1 0 ; 
BRN gs: ρελ} NG oe Aang 


3 Describe the transformations represented by the eight matrices 


+1 0 Oe 1 
0 +1)’ +1 Of} 
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4 Innos. 2 and 3 above, check the effects of the transformations on areas, 
and compare the values of the determinant ad — dc in each case. 
5 Obtain the matrix for rotation about the origin through 6 
(1) by obtaining x’ and ψ' in terms of x and y by projection; 
(11) by using complex numbers, starting from the equation Ζ' = z cis α; 
(111) by the method shown in the text above. 


6 Obtain the matrix for reflection in the line y = x tan @, by first rotat- 
ing the coordinate axes so that the x-axis coincides with the reflection axis 
(operation R), then performing the reflection in this axis, and finally 
restoring the axes to their original position (Κ᾽). 


2 =] 0 
7 A triangle has vertices (ἢ) a( ᾿ o( ᾿ 


What is its image under the transformations represented by the matrices: 


ae By ee OWN. Gan 4-θ 
(ἡ ba a μὴ ( " ii) ( a) 


8 Investigate the effect of the transformation represented by on 


1 1 
0\ (0 ] ] 
the rectangle with vertices ( 7 "ἢ ( ( ᾿ 


9 
9 What is the image of the point 9 under the matrix which moves 


Ὁ)" [ὦ (i) 50} 


2:4 


Show that M = 


operating on any vector in the direction 


zs te La —3 
ἢ leaves it invariant. Show that vectors in the direction ( ; are also 


unchanged in direction, but are enlarged by a factor 5. Repeat, using the 
matrix M?. 


: : ; 0 ] 
11 Finda2 xX 2 matrix which transforms the vector () to the vector ( 


. 
and leaves ()) invariant. What type of transformation is it? 


12 Show that, ifa 2 x 2 matrix represents an zsometry, 1.e., a distance- 
preserving transformation, then it must be either of the form 


( cos @ sin @ cos 9 sin 4 


lse of the fe 
—sin 8 cos a ce bee —cos θ 


1.6., a rotation or a reflection. 
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Transformations in three dimensions 


All the above can be extended to a higher number of dimensions. In three 
dimensions, for instance, we can consider the equations 
x = ax t+ by + C42 
Y = Ax + dy “Ὁ Cz 
Z = a3x + bay + €3z 
as representing a transformation which maps 
x x 
the vector [ y | into the vector [ y 
Z 
Just as in the two dimensional case, this linear transformation which 


preserves the origin may be completely described by a matrix, in this case 
the 3 Χ 3 matrix 


Ay ΡΥ. Cj 


a, bs 63 


so that the transformation may be written 


/ 
x 7. Oe. Gy fe 

/ -- 
ψᾷ,}[ΗΝΆΓ-- 6, 42 ὠ}{ῦ0|}. 
vk Gs. Dy 23) \2 


or more briefly as x’ = Mx. 
If we label the points (1, 0, 0), (0, 1, 0) and (0, 0, 1) as U, V, W respect- 
ively, the transform of the point U will be given by 


a, ὦ, τ, Ι a, 
ay by Cot O] =I a, |, 
a, ὃς Cs) \O a> 


i.e., the coordinates of the point U’, the image of U, are the first columnt of 
the matrix. Similarly, the second and third columns of the matrix give the 
coordinates of V’ and W’: 


ἃ be ὧν 70 b, a, 6b, ἂν {0 ‘ag 
ao by Co lf l=] 4.4; ag ὃ. Cott OF =] ec, 
a, ὃς cz} \O ὃ. ag bs ¢3/ \I C3 


+ It might therefore seem that columns are more important than rows. But this is only because 
we have chosen to represent vectors by columns. Had we used row vectors, and post-multiplied 
by the transforming matrix, its rows would have given the images of U, V, W. 


13.2 LINEAR TRANSFORMATIONS AND THEIR MATRICES 393 


Thus the images of the vectors 


Ϊ 0 0 
are given by the 
columns of the matrix 


In this sense, the 3 x 3 matrix may be thought of as a collection of three 
column vectors a, b, ec. Conversely, if we wish to write down the nine ele- 
ments of the matrix which represents a particular three-dimensional 
transformation of the type described, all we have to do is to find the images 
of the three points U, V, W, and their coordinates will give the three 
columns of the required matrix. 

The cube having OU, OV and OW as adjacent edges is taken by the 
above transformation into a parallelepiped having OU’, OV’, OW’ as adja- 
cent edges, and the effect of the transformation on the lattice (i.e., solid 
network) of unit cubes will be to take them into a lattice of parallelepipeds. 


Effect of linear transformations on volumes 


We have seen that the linear transformation whose matrix is 


a, ὃ, Cy 


a, bs Cg 


transforms the unit vectors 


] 0 0 

ἐ-ΞΞ- [{ 0}, 7ΞΞ[1}, k= }]0 

0 0 ] 

ay b, Cy 

into a = [dz |, ὃ -- 1 dp }, Ο ΞΞ ὁ) 
as ὃς {3 


But the volume of the parallelepiped formed by a, ὃ, c has been shown 
(in 11.5) to be the scalar triple product 


a.(b X €) = ακ,δ.ος — ayb3lo + bylCgag — θηῦηα.) + Ογαοῦς — C,Agb5 
which is called the determinant} of the Matrix M and written 
A = |M| (or det M). 


+ Just as the area ad — be of the basic parallelogram in the two-dimensional case was the 


b 
4] 


determinant of 
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So the transformation whose matrix is M leads to volumes being altered 
by a factor Μ΄. 


Determinants will be discussed more fully in 13.5 and 13.6, but at this 
stage we simply note that when the determinant is zero the parallelepiped 
has zero volume, 1.e., the three vectors are coplanar: 


ΜΗ Ξ- 0 + a,b,c are coplanar. 
x 


In this case, a vector [| y ] in the original space will now be transformed 


into a vector 


ay b, 


sS 


1 
χῇ α. } τ ψ} δ. } + z] co | = xa + yb + Ze, 


a3 bs 


Ss 


3 


which lies in the plane of the vectors a, b, ec. So the entire original three- 
dimensional space 15 mapped onto a plane, a two-dimensional space; or, 
in exceptional cases, onto a line or even a point. 


Examples of three-dimensional transformations 


Let us set up the 3 x 3 matrices for transformations in a number of simple 
cases. 

First consider the reflection in the plane UOV (z = 0). The points 
U and V are invariant, and the point W clearly goes to W’ (0, 0, —1). 
Hence the matrix 15 


1 0 0 
0 | O 
0 0-1 


Next we consider a quarter-turn about the y-axis, the sense being such 
as to take the point W into the point U, 1.6., W’ is (1, 0, 0). We see that 
U’ has coordinates (0,0, —1) and since V is invariant, the required 
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Ζ 


matrix 15 
0 0 | 
0 1 0 
-]1 Q 0 


0 0 1 $ 
1 0 O 
0 1 O 


From the columns, we can read off the positions of U’ (0, 1,0) = V, 
V’ (0,0, 1) = W, and W’ (1, 0,0) = U. The diagram makes it evident 
that we are concerned with a rotation through 120° about an axis passing 


Ϊ 
through (1, 1, 1). We may check that the vector | 1 ] is invariant under this 
] 
rotation, for 
0 0 1\/1 ] 
1 0 OW 1}= 1 
O 1 O/ \l | 


[An important idea is suggested here, that of an ezgenvector, whose direction 
_ is unchanged in a linear transformation. | 
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All the matrices so far considered have included only 0 and 1, but 
3 0 0 
0 1 0 
0 0 1 


represents an x-stretch in the ratio 3:1, while 


500 
0 3 0 
0 0 3 


is an enlargement in the ratio 3:1, the vector OP being replaced by 
ΟΡ’ = 50Ρ. This matrix is known as a ‘scalar’ matrix, since its effect on 
every vector OP is the same as the effect of multiplication by the scalar 
number 3. The general scalar matrix 


k O 0 
O k O 
0 ὁ &k 


represents a pure enlargement by a factor k when k > 0, but when k < 0 
it involves ‘reflection through the origin’, or ‘central inversion’, exempli- 
fied by the matrix | 


-l 0 0 
0 - Of, 
0 0-1 


whereby every vector is reversed in direction. 

Lastly, we consider a transformation in which the origin is not fixed. 
Under a transformation of the form x’ = Mx, the origin maps into itself. 
How can one express in matrix form a transformation which does not 
possess this property? The simplest example might be a translation repre- 
sented by x’ = x + a, where a is a fixed vector. In the two-dimensional 
case, we have 


χ =xta 
‘= ytd 


and this may be written 


x 1 0 a\[x [0 x 

yy eM a 
y J}= 10 1 ὁ y |, oras “ ( a ty ) y 
0 0 1 l ] 
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Exercise 13.2b 


Write down the 3 x 3 matrices representing the following: 
1) reflection in the plane y = 0; 
1) reflection in the plane y = x; 
111) reflection in the plane y = x tan @; 
wv) reflection in the plane x + y + z = 0; 
v)  half-turn about the x-axis; 
υ1) half-turn about the line x = y, z = 0; 
.) half-turn about the line x = y = z; 
1) quarter-turns about Z-axis; 
quarter-turns about the line x = 0 = y + Z; 
x) rotation through @ about the y-axis; 
1) an enlargement from the origin in the ratio 9:1; 
1) an orthogonal projection on to the plane y = 0. 


~~ ee 
oa: 


Be 
~~ 


I 
( 
( 
(222 
(2 
( 
( 
(ve 
(vt 
(2 
( 
( 
( 


2 What transformations are represented by the following matrices: 


1 Ol 1 0 0 0 10 0 0 20 0 
O-1 0]. [01 OF}, [0 0 1!Π, [0 0 -Ἰ]. [10 95 0]. 
0 0 1 0 0 0 00 O 1 OQ 0 0-1 
—-3 0 0 —0.6 1 O 0 

0-3 Of, 0.8 0 0 0.6 0.81} 

0. τ) 599 0 0.8 —0.6 


3 Consider the effects of the above transformations on volumes and check 
by evaluating the determinants in each case. 


6 —4 3 Ϊ 
4 Show that the matrix [29 —20 13} ἰγδῃϑβίουπηβ the vector | 2 | into 
33 —23 14 Ϊ 
itself. 
Z ] 0 
5 Check that the vectors | 1 }, | 0 }, | —1 ] are coplanar, and investigate 
0 ] 2 
the effect of the matrix 
2} 0 
1 0 -|Δ 
0 1 2 
+l 


6 Find the effect on the cube with vertices | +1 ] of the matrix 
+] 
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0-1 O 
1 O Of}. 
0 0 -ἰ 


Repeat for some of the matrices mentioned in nos. 1, 2, 5 above. 


7 Check that, ifa 3 x 3 matrix M transforms the vectors x and y into 
the vectors x" and y’ (i.e., x’ = Mx and y’ = My), then the vector 
z = Ax + py will be transformed into the vector z’ = Ax’ + py’. 


8 Show that a glide reflection in the x-axis, in two dimensions, whereby 
each point is reflected in the x-axis and then undergoes a translation 
parallel to the x-axis, may be represented by a3 x 3 matrix. 


χ᾽ O 1 a\ [x 

What does the equation | y’ Ξ- [1 O a] y ] represent? 
] 0 0 I \il 
+ ] 0 a 
Describe the effect of the matrices 0 ἘΠῚ ὁ 
0 O 1 


in the four possible cases of selection of signs. 


13.3 Products of linear transformations and matrices 


Suppose that the general 3-dimensional vector x is transformed to give a 
vector x’, and that this in turn is transformed to give x”. Let us further 
suppose that these transformations can be represented as 


x" = Ax’ and x’ = Bx 
u% βι γι a, ὃ, ο 
where A=[ a, β. γε] and B=fa, ὦ. Cc}. 
3 Bs Ys az bs 65 
x" = aye + βιν' +942’ x’ = ayx + bry + οιΖ 
Hence γ΄ = ax’ + Boy’ + Yoz’ Yo = gx + boy + Coz. 
Ζ΄ = Ox" + βεν' + 32’ Zi = 3x + bsy + 652 


Substitution gives 

x" = Oy (ayx + by + 6,2) + By (agx + boy + 022) + Py (agx + bay + 632) 
(414, + Byay + Yya3)x + (%1b, + Bib. + ¥1b3)y 

+ (16, + Bycg + γ109)Ζ. 


Similarly we can write down expressions for y” and z”, and the three 
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results can be combined in matrix form: 


x" Ha Ὁ βια; Ἔγ1α495 %b,+Bybgty1b3 16, + Bylo +7163\ [x 
γ" Y=] 2a, + Boag t+ Yod3 αορὐ, + Bobet γοδ9 AQ¢ + Pate +7263 
Zz" 30, + Bydg+ 7303 %3b,+Bsbo+73b3 40, + Bylo +7363 

or a = Cx. 


where C represents the combined effect, or product, of the two transforma- 
tions. 


But x” = Ax’ and x' = Bx 

can also be combined to give 

x" = A(Bx), 

so we now Call C the product of A and B and write 


AB=C and x” = (AB)x: 


By 
β. 
Bs 


1a, + Brag +143 01), + Bybg +713 αχοὶ + Byeg + Vy C5 
Wd, + Body + γ,ηας αοὖ, + Boba + Yobzg HQ¢, + Baca + Yol5 
3a, + β9ςα; + y3a3 030, + β9Ε. + y3b3 α46, + Bgcg + 7363 


Hence we see that the products of 3 x 3 matrices are formed just like 
those of 2 x 2 matrices (see 0.7) by ‘row and column multiplication’. 
More generally, we define the product AB of two matrices A and B in 
just the same way, the element of AB in the ith row and jth column being 
called the inner product of the ith row of A with the jth column of B. This, of 
course, necessitates that these two have the same number of elements, i.e., 
that the number of columns of A is equal to the number of rows of B, such 
matrices being called conformable for multiplication. 


Hence if Aisanm Χ ἢ matrix 
and Bisann X p matrix, 
AB will be an m X p matrix. 


Example 


Find AB and BA when 
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4 5 
(7) A=(1 2 3) B=16 7). 
8 9 
3 —3 
0 | 14 — 
i) 48 - (Ὁ Ι te 4) -ἰ 14 ι 
535 ᾿ς: 
3 —3 
ἢ 1 9 0 6 12 
BANE. nap cap) Ξ Ὁ ete): 
5 —5 0 10 20 
4 5 
Gi) AB=(1 2 3)({6 7|- ((0 46); 
8 9 
4 5 
whilst BA ={6 7](1 2 3) does not exist. 
8 9 


Exercise 13.3a 
1 | 


4 1 ] - 
1 a=(; ,) B=(; ) Cc={2 0 p=(~, 
0 4 
Find where possible AB, BA, CD, DC, DB, BD, DA, AD, CA, AC, BC, CB. 


2 Find the missing elements: 


Ι —3 —25 —-1 1 3 
l —l . 7. —] 
1 | a Tih. “ig )- i) 
] 0 
1 00 
2 Showthat 7 Ξ{[Ὸ 1 0 
0 0 1 


(unit matrix) is an identity for post-multiplication of matrices with 
3 columns, and also for pre-multiplication of matrices with 3 rows, while 
MI = IM = M where Μ 15 any 3 xX 3 matrix. Generalise. 


6 —3 4 3 
If A= B= 


find AB and BA. Find other counter-examples to illustrate that 
AB = 0+ A = OorB = 0, where 0 is the null matrix. Deduce a counter- 
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example to show that AC = BC + A = B. Obtain examples with 3 x 3 
matrices, taking 


oe ot. ὦ 
A= 1 —3 -Ἰ 
Ι΄ 7 35 
5 Represent the following five quadratic equations by a single matrix 
equation: 
2x? + 5x -- ὃ = 0; x? — 7x +11 =0; 3x7 = 4x + 1; 
5—x—x*? = 0; 4x? + 3x = 0. 


6 If Aisa2 x 2 matrix such that 4“ = 0, find the general form of A. 
Repeat when A* = I. 


7 Find all the matrices that can be obtained by forming products (e.g., 
ABA’) from 


— |] 
Bet EN pelt 
ι O ] 0 
; wo 0 0 ω 
Repeat with C= (5 a D= & ᾿ 


(Note that in this question 12 = —1,@* = 1.) 
8 Show that the only 2 x 2 matrices which commute with every 2 x 2 


k 
matrix are those of the form ( a Extend the result to 3 Χ 3 matrices. 


9 If A= (_* a B= Ee s) 
find AB and BA. What does this result remind you of? 
10 Find the products: 
2-1 0 5 —3 2 
@) [3 1 -—2 9-6 4f; 
O 4 —3/ \12 -—8 5 


(1) ( Fa 3 hae (_, : 2./2 Pas, 


RPE eee 4 2 5 — 21 
ae) ( )( } ) 
—-4 341 - — 21 2 


) at 
=3 5) | 
If AX = XA where X is a 2 Χ 2 matrix, show that X may be expressed 


in the form (! 7 ) 
q p-q| 


τι If A= find 4“, A’, A*,.... 
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Find the general form of matrices which commute with a given matrix 


1 O 0 
2 If M={0 0O —1j], prove that M*"*' = M. 
0 -ἴ 0 


0; 
3 1 C= ( ἢ find C*, C’,... and show that σ᾽ = J. 


πὶ 4 


be eed a0 O\ fl gi ἢ 
14 Express | 1 1 0] intheform [ὁ c OF{O 1 ἀ!. 
4 8 “--Ἰ de f/\0 0 1 


15 IfA, B,C are 2 x 2 matrices, prove that (A x B) x C=A x (Bx ΟἹ 
(associative law of multiplication). Extend the proof to apply to 3 x 3 
matrices. How can the proof be generalised to apply to any three matrices 
which are conformable for multiplication? 


16 Interpret row by column multiplication of two vectors r, , ry in 3-space, 
expressed by the equation 

εὐ" 
(%1 ψι 21) | Yo | = Χιχο + YiY2 Ὑ 2122- 

£2 
Show that this ‘inner product’ of the two vectors r,, rz has the value 
rT, cos 9, where r,, 72 are the lengths of the two vectors, and @ is the angle 


between them. How may the concept of ‘angle’ be extended into more 
than three dimensions? 


ah g\ [x 
17 Express (x y χΖζὴ ἡ ὁ {}} 
g fc} \z 


as a single expression, and show how x* + y* + 2gx + 2fy + ¢ may be 
similarly expressed. 
Repeat with the expression 2x* + 4y* — 6xz + 3yz. 


18 Investigate the effect of pre-multiplication of a column vector of 
dimension n by ann Χ n matrix containing 7 15 and the rest 0s, each row 
and each column of which contains a single 1. Repeat for post-multiplica- 
tion of a row-vector (such matrices are known as ‘permutation matrices’). 


19 Interpret the matrix equation 


0 1 23-1 Οἡ (/-1 5 0 -4 
=f) OVS oo Ora Veo μον 0 


as a geometrical transformation applied to a certain quadrilateral. 
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t / f 
Gin τἀ κω τας Gi δὲ HCN [hy ἄδεινδῃ 
. t , a 
20 Theequation | }} 9o---% JH] a 02 ΟΠ} Κι. Yo---Yn 
Ze nee Ge De. RIND Zoe ozs 
x 


may be interpreted as follows: the n points [ y, | (r = 1, 2,...,m) are 


Ζ, 


transformed by the matrix (a δ᾽ ΟἹ) into the 7 points [ y; }, so that 
2 


r 


we are thinking of each of the first and third matrices as collections of 
column vectors. Interpret in this way the equation: 


0 0 ἡ] 4 0 7-1 3 O-5 I 2 
1 0 OFF2 -3 2 0 πΙ|ΞΙΙ 4 07-1 
O 1 O/ \3 O-5 1 2 2B: “2, r= 


Successive transformations 


We may now consider the combined effect of certain successive three- 
dimensional transformations (which would be difficult to draw or even to 
visualise) by simply applying the multiplication rule for the corresponding 
matrices. For example, the products 


00 010 0 10 
0 0 1} }]-1 0 0]Ξ[| 001 (1) 
0 1 0 001 a 
ΟΙ ΟΠ 0: 0 0601 

and [- 0 0|{0 0 1|-Ξ-[--1 0 0 (2) 
0 0 1/\0 1 0 0 10 


illustrate (1) that the quarter-turn about the z-axis, represented by the 
matrix 


010 
-1 000]. 
001 


followed by the reflection in the plane y = z, represented by 
1 0 0 


0 0 14, 
0 1 O 
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combine to give a transformation represented by 


0 1 O 
0 0 lf; 
—-l1 0 0 


(2) that if these operations are commuted, a different transformation 
results. (In both cases, the composition transformation is not a ‘simple’ 
one, being known as a rotatory reflection or rotatory inversion (see Coxeter, H., 
Introduction to geometry, Wiley, 1961, p. 99.) 

On the other hand, the scalar matrix 


k 0 0 
0 k 0 
0 0 &k 


does commute with other 3 Χ 3 matrices; so that, for example, when the 
quarter-turn 


1 0 0 3 0 O 
O 0 14 iscombined withthe enlargement [0 3 Of, 
0-1 0O 0 0 3 


the result is the same when the order of operations is reversed, being repre- 
sented by 


3 0 0 
O OQ 31] in each case. 
0-3 0 


Suppose next we consider the transformations represented by the matrix 
product 


0 0 1\ /l O 0 O 1 O 
O 1 OFF 0 00 -1l = 0 0-1 
—1 0 0/ \O 1 0 —-1 0 0 


Each matrix on the L.H.S. represents a quarter-turn, but it is not obvious 
what the product represents. We consider its effect on the general vector 


x 0 1 O\ /x y 
y |, namely 0 0 -l]ify}=] -2 
Ζ -Ι 0 Of \z =% 
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and ask under what circumstances such a vector would be unmoved by the 


transformation. This would require x = y, y = —Z, Z = --α, 1.€., any 
k ] 

vector of the form Kj, or k 1], would be invariant under this 
—k —] 


transformation — another example of an eigenvector. We suspect, for 
this reason, that it is a rotation. Surprisingly, the rotation is through 120°, 


y = 
for if the matrix is applied again to | —z | it becomes x |, and after 
x 


another application it finally becomes | y }. As an exercise, you should find 


᾿ς 
the effect of the same two quarter-turns when they are (a) reversed in sense 
(one at a time), (6) commuted, (c) both. Finally, observe that, 


O 1 0 1 Q O\ /fO 1 O 
since 0 O0-1}=]0 -1 O7FF0 0 Lf, 
=! 0 0 0 Q-1/ \l O O 


the transformation may be analysed into a 120° turn about the lne 
x = y = Ζ, represented by 


0 10 
0 0 1f, 
1 0 0 


followed by a half-turn about the x-axis represented by 


1 Q 0 
0-1 0 
0 0 --Ἰ 


Exercise 13.3b 


0 -1 -1 0 20 1 1 
I If A = ; B = ; C = — 
( 7 ( 0 ᾿ : 2 Ω ( Ἷ 
evaluate the products AB, BA, AC, CA, etc., also C*, D?, ABC, CBA, 


ΒΑΡ, etc. and interpret geometrically in each case. 


2 Repeat the above question using the matrices 
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010 st ἢ τ 001 
P={0 01}: @={ 0-1 0]; R=l0 1 O}; 
1 0 0 0 01 1 0 0 
2 0 0 
S=|0 2 0] 
0 0 2 


ἧς ie κι = ( COS ἃ ne ταὶ R, = ( cos β τ 


—sin® cosa —sin fp cos β 
find R,R, and R,R,, and deduce certain trigonometrical formulae. 


4 Decompose the following as the products of two matrices representing 
‘simple’ two-dimensional transformations such as rotations, reflections, 
enlargements, etc. : 


᾿ς oe Oy ox f OSB) (1 «8. 
(ἢ ( : 3) (ii Ἁ " (ii) & a 
᾿ς Ὁ 1D 9 --ϑλ , {0 -2\. 
i) ᾿ A 0) ( ἡ (i) " 5) 


5 Repeat the above question with the 3 x 3 matrices: 


0: 22 -1000 0 2 0 
G) | 0- 0], (ὦὁ} 0 0 1); (aH) [2 0 0]; 
-Ὁὁ 0 0 0.30 00 2 
0610 
(iv) {1 0 Of. 
00 0 
δ =e 3 
6 Show ται M=[29 --20 13], 
$3293. 14 
Ι 
then Μ|2] -[2}] and M=L. 


] ] 


What do you suspect? 
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7 Find vectors (‘eigenvectors’) whose directions are unaltered by 


1 —2 0 
N = 1 --ΖἽ O}. 
-l 0O 1 


Show that N* = J, but that N does not represent a quarter-turn. 


8 Experiment with the composition of rotations of period 2 (half-turns), 
3 and 4 about various axes as was done in the last paragraph of the text, 
and also with reflections in various planes. 

(A matrix M is said to have ‘period π᾿ if M" = I when r = n, but 
Μ' # Iwhenr < n.) 


9 Constryct 2 x 2 and 3 x 3 matrices with period 2, 3, 4,5,.... 


10 Ifa3 x 3 matrix has the property M* = Iand MX = X, show that 
it may well not represent a half-turn about the vector X. What conditions 
are needed in addition to the two given in order that it may represent a 
half-turn ? 


1: Misacertain 3 x 3 matrix whose elements are 0, 1 or —1, and each 
row and each column of M contains exactly one non-zero element. Prove 
that M*, M°,..., M" are all of the same form, and deduce that M* = I 
for some positive integer ἡ < 48. Interpret the action of M on a vector 
(x, y, Z) geometrically. (M.E.I.) 


12 Ifthe matrix of a plane translation is 


Ι O a Ι QO an 
T=10 1 Of, showthat 7 =|]0 1 ὀη]. 
0 0 |] 0 0 1] 


Investigate, by using 3 Χ 3 matrices the composition of translations with 
other types of plane transformations. 


13 Interpret the plane transformations 


x 1 0 O\ /cosO —sin@ O\ Δ 0 O\ /x 
y }= 10 1 kotsin@ cosO OFF 0 —-1 Ο0}}γ]}. 
] 0 0 |] 0 0 Ὸ 011 ] 


and expand the product of the three 3 x 3 matrices. 


13.4 Inverse transformations and matrices 


So far we have discussed the effect of a particular transformation on a given 
vector: ‘What becomes of it?’ We now ask the reverse question: ‘Under 
such and such a transformation, from where does a given vector originate?’ 
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Or, in matrix notation, 
if x’ = Mx, can we finda matrix Nsuch that x = Nx’? 
If so, it will clearly follow that 


x = Nx’ = NMx 
and x’ = Mx = MNx’. 


So, if these are true for all vectors x, x’, then MN = NM = It 

and our problem is to find N which is inverse of M (and which we shall 
write Μ᾿). . 
Two dimensional case 


It will be recalled (from 0.8) that in the two-dimensional case the trans- 
formation 


O)=€ IQ) 


had such an inverse, provided that its determinant A = ad — be was 
non-zero. 


a ὦ 24 ] d --ὖ 
If M= ( i) then Μ᾿" = = τί: Ff 
| d —b\fa 6 
—j δας 
sothat M 'Μ ’ -Ξ: Pa κί ἢ ( 4 


_ l ad — be 0 
ad -- be 0 ad — be 


l 
-- (; ἢ =I (and similarly MM~' = ἢ). 


If, however, A = 0, there was no such inverse, and the transformation 
together with its matrix M were said to be sengular. 


Three-dimensional case 


Let us now consider the transformation 


x = a,x t+ by + 6,42 x" a, 56, 61\ [x 
γ᾽ = Agx + boy + Coz or y P=l a δ. Co fly 
Z’ = a3x + day + C32 Z. ie δὲ 97 ΔΖ 


+ We shall use J to denote the identity (or unit) matrix of any order. So here, for instance, 


1 0 0 
1 O 
I=[0 1 0; but in the two-dimensional case which follows, I = ¢ ᾿ 
00 1 
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, 


Xx x 


Our problem is to find the vector | y | from which the vector] y’ Joriginated. 


“ 


Ζ 5 
] 
We start by asking the simpler question of where the vector] 0 [ originated, 
0 
x 
1.e., we try to find | y }such that 
a,x + by τ ο2 Ξ 1 | (1) 
Ax + boy + 6,22 = 0 (2) 
a3x + bay + 042 = 0. (3) 


Looking at the last two of these, we see that they are requiring us to find 
Χ Ao ἄς 


a vector [ y | which is perpendicular to both { ὁ. | and ] dg |. 
Ζ Co C3 


Now we know (see 11.5) that their vector product is precisely such a vector, 
and this we abbreviate by writing the vector product thus: 


bolz — 40. A, 
CoA3 — 044. {[- B, 


Similarly, we write 


C34, -- (γᾶς |} =| B, |] and | cia, — cna, | =| Bs; }. 


We now consider the matrix M* formed from these capital letters, with the 
A’s forming the first row: 


A, A, A; 
M* =|{B, B, Β.]. 
ὃ ὦ ἃς 
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This is called the adjoint matrix, and we now consider the product 
a, ὁ, δὴ (4, Ag Az 
MM* =a, ὃ. co}{B, B, B, 
a, 6b; 094, \C, Cy Gs 
a€,A,+6,B,+6¢,C, a,A,+6,B,+6¢,C, a,A,+6,B,+6¢,C, 
=| a,4A,+5,B,+6.C, dygAg+b.By 46.0, dA, +b,B, +620; |. 
a,A,+56,B,+¢5,C, 4a34,+6;,B,+¢,C, a,A4,+6,B, +65C; 
A; (2 a3 
Now { 8, ] 1s, by definition, perpendicular to [ ὁ. } and [ 3, }. 
Cy . {8 


50 aA, + bo.B, + oC, = Oanda,A, + 6,B, + ¢,C, = 0 (as may be 
verified by direct substitution). Indeed, the only terms in MM* which 
do not vanish are those on the leading diagonal: 


a,A, + 6,B, + 6,€0,, ἀ,)4. + δ... + 0C,, ας4ς + 9.8. + σῶς. 
But each of these is quickly seen to be equal to the expression 
A = αἰδοῦς — aybgeg + διύρας — δι,θ5α, + (χαφὺς — 61A3bp, 


which we have already called the determinant of M. 


A 0 0 
So MM* ={0 A O4=ATI, where A = det (M) 
0 0 A 


and it can similarly be proved that 


M*M = ΔΙ. 


Now if A ¥ 0, it follows that 


so that M*/A is an mverse of M for both pre- and post-multiplication and 
we can write 


_, _ © 
Μ΄’ = a (provided A « 0). 


It is immediately clear that M~* is unique, for if we suppose that L is 
another inverse for pre-multiplication (or left inverse) such that LM = 1, 
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then M7! = IM! = (LM)\)M™' = L(MM"')t = LI= L 
and L is identical with ΜΓ΄. 
Similarly, M~* is the unique inverse for post-multiplication. 
If, however, A = 0, then M has no such inverse and is said to be singular. 


Example 1 


Find the adjoints and the inverses of the matrices: 


1 0 0 0 1 2 
(7) A=]0O0 O 24]; (7) B={1 2 3 
0-1 0 2 3 4 
1 00 2 0 0 
(1) Since 4 ξῷῃ0 O 2], weseethat A* =] 0 0 —2 
0-1 0 0 1 0 
1 0 O\ /2 0 0 2 0 0 
=> AA*=[0 O 20 ὁ —2}=] 0 = 2] 
0-1 O/ \O 1 0 0 0 2 
1 0 0 
= A= 3A*=10 0-1 
0 δ 0 
0 2 ol. “2 
(1) Since B=f[1 2 31], weseethat B* = 2 -Ξά 2 
2 3 4 -Ἴ 2-1 
= 21 0 0 0 
=> BB* -- [1 2-4 2]|Ξ[0 0 0|-Ξ0. 
2 3 4/ \-1 2 --] 0 0 0 


Hence A = 0, so that B is singular and does not possess an inverse. 


Example 2 


If A and B are both non-singular matrices, show that: 


πὰ τως: (i) (AB)7! = ὰ τ 
(1] The inverse of A is A~’, 
so 4 =A A=. 


+ Using the associativity property of matrix multiplication, (see Exercise 13.3a). 
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Hence the inverse of A~' is A 

and (A7!)-! =A. 

(ἃ) (AB)(B~'A7') = A(BB")A7! = AIA7! = AA“! = 
and (B~'4~')(4B) = B-'(A~'A)B = BUUIB = BB =1 
So the inverse of AB is B™'A™! 

or (AB)~! -ΕΒ΄ ἸΑ 1. 


Example 3 
Given the transformation 


x =x + Qy + 32 
‘= 3x + y + 2z 
z’ = 2x + 3y + z, 


= 
| 


find: (1) its inverse transformation ; 
(11) the point from which (1, 2, 3) arises under the given 
transformation. 


The matrix of the original transformation is 


1. 2. 12 
A={3 1 2], 
2 τὸν Ἢ 


and we can easily find its adjoint matrix 


cen ey ne | 
A* = Ι -ὸ 7 
11.555 


Furthermore, it is seen that 


Yb 2. oy feo 7 ἢ 18 ὁ 0 
AA* =13 1 2 1-5 7 ]J=] O 18 OF = 18h 
2 3 J Ἄν..." 0 0 18 


So the determinant A of the transformation is 18, the transformation is 
non-singular and 


-5 7] 


At=—=21 1-5 7). 
A 18 
7 1 —5 
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Hence the inverse transformation is 


x = τεἰ(--χ' + 7γ' + z’) 


z= (7χ' + y’ — 52’) 

Finally, putting x’ = 1, y’ = 2, χ' = 3 we see that the point (1, 2, 3) 
arises from x = 3, y = 2,2 = —4,i.e., from the point (3, ξ, —3). 
Example 4 


Solve the equations 


axtbyt+ez=d, 
AX + boy + Coz = dy 
A,X + bay + C3z = ὦ. 


These may be written as 
Mx = d, 
ἄγ. “205. Cs d, 
where M=fa, ὁ. cy} and d=] d, 


a, ὃς C3 ds 


Case (1) If M is non-singular, then it possesses an inverse M~' and 


Mx=d 


> M='(Mx) -Μ᾿ 'ὰ 
> (Μ 'Μ)χ:μ᾽ ἰὰ 
=> Ix = M"'d 
=> x = M'd. 


In a particular case, with numerical values for all the a’s 6’s and c’s, we 
could find the matrix M~* and hence solve the equations. But this is a 
tedious process and, though we shall investigate it further (in 13.6), we 
shall also (in 13.7) learn how more practical methods can be employed. 


Case (11) If M is singular, the above method breaks down as M ~* does 
not exist. But we shall see (in 13.6) that a useful result can still be obtained 
simply by using the adjoint matrix M* rather than M*/A. 


Example 5 
Given 4? +m?2=13 +m3=1 and I,l, + mm, = 0; 
prove that ἰξ + 13 =m? +m3=1 and I,m, + l,m, = 0. 


First of all the reader should attempt to prove this by ordinary manipula- 
tion of the equations. Even if successful, it will be agreed that the task is 
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vexing. But how much simpler it is to use matrices. For if we let 
A= ᾿ " and B= ( 2 ) 
ly My ΠῚ Ms 
we immediately see that 
AB = lie Mie Vfl -13 = lt + mi lily + mm 
ly my)\m, mo lilo + mm, 15 + m3 


ΕΝ 


Hence A and B are inverses, so that 

ΒΑ - be bp l, om, - I? + 13 Lim, + lam, = 1 O 
mM, ™My/\l, my lim, + l,m, mz +m3 01 

=> 7 +13 =m? +m5=1 and I,m, + lym, = 0. 


Should it be protested that the saving in work is not very great, consider 
an extension of the given problem into three dimensions: 


Given the equations 


{ξ Ὁ πη -- πα =] (1) lols + mgm, + nong = Ὁ (4) 
[5 +mo+n = 1 (2) lal, + mgm, + nn, = Ὁ (5) 
[3 - mst nz = 1 (3) Lilo + mm, + nn, = 0 (6) 
prove that 


H+ +R =1; mtmtm=1; w+ +n =1; 
and myn, + mony + 1575 = ml, -Έ ποί. tingly = lym, +lomy+1,m, = 0. 


ΤῸ attempt this by ‘ordinary algebra’ would be prohibitive. By matrices, 
it is scarcely any more difficult than the 2 by 2 case above. For we may take 


i, om, Ny 
A={l, mz Ng |, 
ἰς M3 Ng 


and B to be its ‘transpose’ (i.e., with rows and columns interchanged), 
and the reader will have no difficulty in completing the proof on the lines 
of the 2 x 2 case. Note that equations (1)—(6) are represented by the single 
matrix equation AB = J. 


Exercise 13.4a 


: ; a ὁ 
1 Obtain the inverse of ( ἢ : 
ἃς 
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(i) by letting it be (! ᾿ 
r Ss 
(1) by solving the equations ax + by = x’ 
cx + dy =y' 
to obtain x and y in terms of x’ and y’. 
Bea 


2 Find the inverse of ( 9 


᾿ and hence find the quadrilateral 


+1 
which is mapped by this matrix into the unit square ΄ " 


3 Show that the inverse of the product ABCD...is...D~'C™'B™'A™’, 
where A, B, C, D,... are square matrices of the same order. 


4 A 2 xX 2 matrix M has the property that 


δ αν 
b q d 5 
Prove that, provided a, 5, c, d satisfy a certain restriction, then M = VU 
where pee << and Va(e 7). 
b d 4 5 
State the restriction on a, b,c, d. (S.M.P.) 


5 Aand Bare matrices such that AB = A, BA = B. If Ais non-singular, 
prove that A = B = I. If A is singular, prove B is also singular, and that 


(A - B)* = 0. (M.E.I.) 
2—l1 4 
6 Find the adjoint and the inverse of [| 4 0 2] and hence solve 
the equations a2. 1} 
2x—y+4z=1 
ne 2 | 
3x — Ὧῃ + 72 = —3. 
Ι.:. 20 
7 Find the adjoint and inverse of M={—-—1 3. 1] and use to find the 
0 4 2 
=2 
column vector which, when transformed by M, becomes Ϊ 
] 


Ια ὦ 
8 Find the inverse of the triangular matrix | 0 1 ς 
0 0 


] 
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9 Find the inverse of | ¢ a ὁ | and state under what circumstances 


bc a 
it 15 singular. 
atb=3 1 1 0 
10 Solve 6+c = 4 by first inverting the matrix |g 1 |] 
a=) 
I Oto 


11 Under what conditions does M = M™!, where Mis: 


ay ὃ. Cy 


. (a ὃ : 
(7) [ a (1), a. by eg}? 


a3 ὃς C3 


Which types of geometrical transformations may such matrices represent? 


1 1 0O 
12 Invert the matrix M=]{5 1 —3] and also the transposed 
a a ἢ 
I; 52 
matix M’=]{1 1 7 
0 --5. 4 


Can you draw any general conclusion about the inverse of the transpose of 
a 3 Χ 3 matrix? Deduce that the inverse of a symmetric matrix (1.e., one 
for which M’ = M) is itself symmetric. 


13. A matrix which satisfies MM’ = M’'M = I, i.e., whose inverse is its 
transpose, is called an orthogonal matrix. Investigate such matrices (1) in the 
2 Χ 2 case (11) in the 3 X 3 case, and give examples of each. 


14 Show that if M is an orthogonal 3 Χ 3 matrix, and MX = X’, where 
X, X’ are column 3-vectors, then these vectors have the same length, L.e., 


/ 


x x 
if X=]y] and X -- Ὶ γ΄], then x+y Ἡ χ ΞΖ χ' +y%42". 
Ζ τῇ 
cosa οοϑβ sina οοϑϑ β -—sin fp 
15 Find the inverse of | cosa sinf sina sin B cos f |. 
sin αὶ — COs o 0 


How do you account for the result geometrically? 


16 If Aisa matrix of period ἡ (i.e., A” = Twhenr = n, but A” # Iwhen 
r <n), show that BAB~’ is also of period n, where B is another square 
matrix of the same order as A. 
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17 Further practice for finding inverses: 


1 4 --ὁἍ 1, 2. Ὁ —15 ll 7 
@) |-2 -5 χη [0 1 2], (i)f{ 11 —8 —5], 
3 3 —10/ 00 1 -1 2 1] 
6 0 -|} 2 - 0 laa 
(2v) 3 —3 —2], (v) | -l 2 --͵ἰ|][ο[ᾳἅ7[. (}]  α 1 al, 
-Ξ 6 3 0 --Ἰ ] aa \i 
l1 a ὁ : ; 
᾿ ἐν zr | {am 1 
(v)fa 1 cf, in| ) (4 " 
-ΙΞ- 1 ὦ 
ne | 
2 
] Oo ω Pas sy 
(x) [ὠ 1 wm |, where : 
‘ ao” = |] 
Oo oa 1 


2 Χ 2 linear equations: three interpretations 


At the beginning of this section, we saw how the inverse of a2 x 2 matrix 
helped in the solution of two simultaneous equations. Following this, we 
considered briefly the solution of three simultaneous linear equations and 
how the inverse of a 3 x 3 matrix might be found and used. The whole 
question of the solution of linear equations will be dealt with in more detail 
in 1 3.6 and 13.7, but this is a convenient point at which to make some 
preliminary observations in the case of two equations. 
Consider the equations 


a\x + by = 7 


dx + boy = Cy 


If M= ᾿ aa x= ("), c= ie 
a, by Co 


these may be written as 


i a a _ te: “ὦ ΠΣ ea 2) 
ay bo) \y C9 
Andif a= 1} b = () 

ao δ, 


they can be written as 
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Each of these alternatives carries a different interpretation. The two 
separate equations (1) suggest two straight lines in the x, y plane, whose 
intersection one is trying to find. Equation (2) suggests that one is looking 


for an unknown vector x = ( ) which is transformed by the matrix 
y 


a, b,\. C ge te 

M = ( ᾿ into the known vector c = ( i where did this particular 
ay Ue 2 

vector originate under the given transformation? The final version (3) 

starts with three given vectors, 


τ} ὅτ) τῷ 
ao by Co 


and we ask what multiples of the first two must be combined to give the 
third, or how can we ‘mix’ the a and b vectors to give the e vector? 
In general, one expects to get a unique solution to each of these questions. 


Thus, in the case Ὥς Ὁ. \ 
x—- y= —6 
we obtain x = —1, y = 2. This means 


(1) that the point common to the two lines with given equations has 
coordinates (—1, 2); 


fo: 2 
(2) that under the transformations represented by the matrix ( ᾿ 


] —] 
the vector ( ; arises from the vector ( ἢ : 


Ι 
(3) that in order to obtain the vector ( ᾿ one should take 
y 


3 
=i] ( 4 Ἔ o( i) This is illustrated in the figure. 
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More generally, such a unique solution for x, y may always be found by the 
use of the inverse matrix when this exists, (i.e., when A = a,b, — αοὖ, τ 0). 
For then 


y αιῦο — Agb,\—ay αι} \Og 
= xX = Sg ΞΞ o_O ὦ  . -------- 


2 Χ 2 linear equations : singular case 
The exceptional case gives two possibilities, illustrated by the examples: 


(1) 3x — Qy = 1 (11) Sep ΞΞ.} 
6x -- 4y = --Ἰ; 6x — 4y = 2. 


Equations (1) have no finite solutions in x, y. The first of our three inter- 
pretations shows this by the fact that the lines are parallel, and so fail to 
intersect. The second interpretation is that the transformation represented 


by the matrix ( 


6 ᾿ is singular, and the effect on all vectors in the x, y 


plane is to take them into vectors having the direction ()) for 


(5 2G) --ἢ-ατν 


; | 
No conceivable values of x, y can cause this vector to be ( 7 which 


has a different direction. The third interpretation is that we cannot possibly 


3 = 
take a certain multiple of a = [ : and a certain multiple of b = ( ἢ 


l 
and combine them to obtain ὁ = [ ᾿ simply because a and b have 


the same direction, i.e., lie in the same one-dimensional space (line), 
whereas ¢ is outside that space. 

Equations (72), on the other hand, have an infinity of solutions for x, y. 
The lines, taking the first interpretation, are coincident, and so have an 
infinity of points in common. (One of the equations, you may prefer to 
think, is superfluous.) You should think out the second and third interpre- 
tations in case (11) as well. 

Note that the exceptional cases in the solution of Mx = c arise precisely 
in those cases when the equations 


a,x + by = 0 


Mx = 0 
eee 0 or x 
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do possess solutions other than x = 9. For if A #4 0, the only possible 
solution to the homogeneous equations 

a,x + by = 0 
Ax + boy = 0 


Exercise 13.4b 


1 Give the three interpretations suggested in the text in the case of the 
equations: 
(2) x+2y= 2 (ἢ) 5x —6y = 105 

3x -- y = 13; —3x + 4y = 


| 

| 
n 

ΝΜ] 


and draw diagrams to illustrate. 


2 Apply the three interpretations to the equations 


2x -- y=a a 4 a —] 
τὰν οἱ 5 ἢ a 7 4 3 


and give a parametric representation of the infinity of solutions in the 
latter case. 


Ξ ἢ ὃ 
3 What multiples of the vectors ( ᾿ and ( ὴ must be combined to 


46 
give the vector [ 9 


4 Prove that the equations a,x + b,y = 0, ayx + b,y = 0 have an in- 
finity of solutions if and only if a,b, — a,b, = 0. Prove however, that in 
this case, the equations a,x + b,y = c¢, and ajx + b.y = cy do not in 
general have a solution, and find under what conditions they do. 


5 Show that the system of equations 
ox — by + 3 = 0 
—3x + 4y —2 Ξξ 


5 -6 3 
ae τἢ ΕΠ the final 


has a unique solution even though the matrix ( 


two columns linearly dependent. 


Codes 


Having established the idea of an inverse, we are now in a position to 
consider a possible application of matrices to the coding and decoding of 
messages. One obvious way of constructing a code is to use a number to 
stand for each letter of the alphabet, thus: 
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A B CG D E F GHtI J K LM 


Row! f} 2 3 4 5 6 7 8 9 10 11 12 13 
Row2 3 6 9 12 15 18 21 24 1 4 7 10 15 


NOP QRS TUVWXYZ 


Rowl {14 15 16 17 18 19 20 21 22 23 24 25 26 
Row2 |16 19 22 25 2 5 8 11 14 17 20 23 26 


Row 1 provides too transparent a method. Row 2 is an improvement. It 
might have been a random permutation of the numbers 1, 2, 3,..., 25, 26, 
but it will be seen that there is a ‘system’, whereby each number in row | 
has been multiplied by 3 and reduced modulo 26 where necessary, €.g., 
21 x 3 = 63 = 11 (mod. 26). The idea of this system is that it makes it 
easy to decode a message without having to keep referring to the crib. For, 
as coding is performed by multiplication of row 1 by 3, so decoding is carried 
out by division by 3, e.g., 11 + 3 = 63 + 3 (mod. 26) = 21, giving U. 


We may write: 
3 x (WORD) — CODED WORD 
3~' x (CODED WORD) —> WORD 


This method has the obvious disadvantage that it may be easily broken 
down on account of each letter of the alphabet being invariably represented 
by the same number — it is a simple ‘replacement’ code. It also suffers 
from the inconvenience of having to reduce modulo 26, a rather awkward 
number. The latter objection may be removed by adding four characters 
as shown: 


Then it becomes necessary to discard multiples of 30, which is easier. ΤῸ 
remove the former objection, we abandon row 2,7 and re-arrange the 


message, say: ‘COME QUICKLY TO MY ASSISTANCE’ as follows: 


CMOUCLOOMOSITNEQC 
OFEOIKYTOYASSAC.Q 


(CJ means ‘space’). Coding by row 1,7 we set up a two-row matrix: 


δα 313 021 312 01513 019 92014 500 
~~ \15 5 17 911 2520 025 11919 1 3270 


which is then to be ‘coded’ by a coding matrix, 


+ A random permutation of the set 0, 1, 2,..., 29 would be marginally better. 
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51 
C = (| " pre-multiplying as follows: 


31 Ι 
cw = ( -- 021 312 01513019 9 20 14 a 


JZ 59 917 911 25 20 02511919 1 3 270 


7 » 4417 72 20 61 2045 641 76 46 61 45 42 ᾿ 
45 75 34 123 37 110 40 75 115 2 133 83 102 76 79 0 
24 1417 12 20 12015 411616 1 15 120 
᾿ Ι5. 4 3 7 20 10 15 25 2 13 23 12 16 19 ᾿ 
(mod. 30), 


giving the coded message: 


XONOQDLCTGATTJOODYABPMPWALOLS. 


In order to decode, we need the decoder matrix, D, which will take us from 


CW to W: D(CW) = W. Thus DC = IJ, so that D (as expected) should 


7 =] 
be the znverse of C,i.e., τ σα ᾽ = (; ᾿ 


Going in the reverse direction then, we have 
2 —1\/24 14 17 12 20 12015 41 16 16 1 15 120 
& me 15 4 3 7 20 10 15 25 2 13 23 12 16 19 4 
33 13 32 21 33 —-18 430 15 -17 0 19 
& —25 —73 —51 -—79 55 —70 -30 55 1 --4] 
9-10 14 5 ᾿ 


—11 31 -ὦ -3. 0 
7 ( 313 021 312 01513019 9 2014 50 


d. 
15 517 9112520 02511919 1 327 ᾿ ac 


and the decoded message can then be read off. Notice that both the word 
spacing and the repetition of letters are completely obscured by this 
method: the three Ss in ‘assistance’ come out as P, M, and W respectively. 

Of course, the coder C' may be chosen in many ways, but in order that 
the decoder D may contain integers only, we must arrange for the determin- 
ant of C (= ac — bd) to be +1. In the above case, [6] = 3x2-—1x5=1. 


For example, 


ao 0 | 
A = he -- Co = : 
if C ( ) [6] Ι, and 2 ( - 


The presence of a zero makes computation easy: 


Do Ne: 15. OPO, ase ll τ 7. ΟἿ᾽ nas 
Ι 0]}λΛ1Ι5ὐ 511 9 ....) 3 13 0 21] ... 


13.4 INVERSE TRANSFORMATIONS AND MATRICES 423 


and the fact that the second row of C, (1, 0) simply moves the top row of W 
to the bottom scarcely gives the game away; for the zero moving to the 
bottom row causes the first ‘word’ of the coded message to have 5 letters 
instead of 4. 

Evidently the method could be made more subtle by using a 3 x 3 
coder, the message being written out in three rows. The computation might 
be heavy, but use of a computer would avoid the tedium. One of the difh- 
culties would be to find a 3 x 3 matrix with integral coefficients whose 
inverse has the same property. However, you might try with the following 
coder: 


2 55 τ 95] 
C={1 0 —1 4, thecorresponding decoderbeing D=|—-9 -- 5 
tom 00 2. 0.1 


(first checking for yourself that CD = 1). 


Exercise 13.4c 


1 Using row (1) at the beginning of this section (A = 1, B = 2, etc., 
with ., ? (J represented by 27, 28, 29, 30, and the encoder 


l ] 
C= ( ) and decoder D = : Ε in the text, 


construct and decode various messages. (This should be done in pairs, each 
pupil constructing a message and exchanging with his partner.) 


2 Find other suitable encoders and decoders, avoiding fractions by 
arranging for ad — be = +1. 


3 Code the message, ‘Come quickly to my assistance’, using 


Zi, τ 2 1 --ἰ|ι 
1 0 --Ἰἰ as encoder, and --9 - 95 as decoder, 
4 2 95 2 Or] 


with the modulo 30 system established in the text. Repeat with the message 
‘The quick brown dog jumps over the lazy fox.’ 


4 Using row (2) on p. 421 (A = 3,B = 6,..., Z = 26) . = 27, , = 28, 
? = 29, [] = 30), the word ADAM was coded as GDNE. What was the 


encoding matrix? Using this matrix, how would the message ‘BEEF 
ESSENCE’ be coded? 


13.5 Determinants 


We have already met instances of both 2 x 2 and 3 x 3 determinants. 
Unlike a matrix, a determinant is not just a table of numerical data, but 
(quite differently) a shorthand way of writing an algebraic expression, 
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whose value can be computed when the values of the ‘terms’ or ‘elements’ 
are known. 
There is little to be said about 2 x 2 determinants: 


b 
ἢ ΞΟ 


and this method of writing such a two-termed expression does not carry 
many advantages. However, there is some point in doing a little work on 
them in the following exercise since the rules for manipulating 2 x 2 
determinants are imitated by those for larger determinants. For example, 


ak ὁ ak bk 


ΣᾺ 
οἰ d ck ἀξ : 


if A=( , then = kA, and 
ἢ 


d 


(not kA, as with matrices). 

So with 3 x 3 determinants, 
a, ὦ, ὦ) ak bik οκά 

ΙΓ Ads ὦν 25),. “then. |4 by. “eg = Δ; 
a3 bs Cs ag ὃς Cy 


while if every element were multiplied by ἀ, the value would be k°A. 


Exercise 13.5a 


1 Evaluate: 


3 —4 .. (8 -—3 oa 8 5 
is 173 163 ἰὴ .42 6 (vi) 13. 1 
v2 
ἜΝ χὰ 104)? nyse 48 1001 77 
᾿ 300 -- 400 
(vit ) 
|—-240 320 
2 Show that 2 42 has a factor (a — ὁ). Generalise the result. 
a 
3 Simplify: 
atba-—b a cos 9 sin 8 
(1) ; (11) : 
a—b a+b —sin@ cos@ 


cos @ sin @ cosh x = sinh u 


(222 ) ; (20) 


sinh u cosh u 


ΧΕ Ζ IN ar SZ 
; (wt) 
y+ qz ry + SZ 


sin@ cos @ 


οἱ ae 
τ π- Δ: Pt 
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a+c b+d a b+ka 
d d + ke 


both have the value A, and write down some other determinants which 
have the same value. 


a 


b 
4 lif A= 4 show that and 
C 


atx b+y 


Express 
3 P c+p d+q 


as the sum of determinants. 


6 Find the value of A so that the equations 

2x + Sy = Ax 

3x + 4y = dy 

may have non-trivial solution. 

2—-x “-| 
=—6. Sex 


7 Solve the equation for x: | 


8 Show that, in general, the solutions of the simultaneous equations 


a,x+ byte, = 0 


may be written 
A,X + boy + Co = 0 


by ὁ 
Use this: 
(1) tosolve 3x —y= —8, x+7y¥=1; 


ax? + bx +¢=0 


(11) to eliminate x from the equations ὁ 
px + qx +r=0. 


3 X 3 determinants 


We now consider the 3 straight lines with equations: 


ax+ byte, =0 (1) 
Agx + boy + Cg = 0 (2) 
a3x + bsy + cz = 0 (3) 


and the condition for them to be concurrent. Solving (2) and (3), gives (in the 
case of non-zero denominators) : 
x _ y Ι 


ee a rs Se ----..-.-.. 


the point of intersection of lines (2) and (3). If this point is to lie also on the 
first line, its coordinates must satisfy (1), and so by substitution we obtain 


A,(boeg — δ909) + δι(ορας — 0449) + οχγ(αοῦς — a3b9) = 0 (4) 


The expression on the left-hand side is precisely what we have already 
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called the determinant of the coefficients in the three equations, written 


Such expressions occur so frequently in mathematics that the determinant 
shorthand notation was devised for them. Just as expressions like 
x X x X x X x x x have the abbreviation x°, and with this abbreviated 
notation comes manipulation using the laws of indices, so with determinants 
we shall want to devise corresponding methods of manipulation. 

Another instance of the occurrence of 3 x 3 determinants in coordinate 
geometry is in the area of a triangle formed by 3 points (x,, 4,1), (Χο, Yo); 
(x3, ¥3) which is given by the expression 


x, y, | 
A= οἰ — χοῦ + Χοῦᾳ — Χο + X34¥, — X1Y¥3) = 3 x2 Yo 1. 
Xz, Y3 | 


‘The condition for collinearity of 3 points is of course that this determinant 
should vanish. 

In this section we shall deal principally with 3 x 3 determinantst and 
then indicate briefly how their behaviour is imitated in the case of deter- 
minants of higher order. 


Expansion of determinants : minors and cofactors 
The expansion of the general 3 x 3 determinant has already been given: 
aq, ὃ, ὁ 


Each term is the product of 3 elements of the determinant, no two in the 
same row or column (i.e., there are no terms like ὁ, οὖς, or a436,¢3), so it 1s 
easily seen that the number of terms must be 3 x 2 x 1 = 6. In the 
case of 4 x 4 determinants, the number of terms (such as 4,5,¢4d3) would 
be 4 x 3 x 2 x 1 = 4! = 24, and in the general case of an Χ ἢ deter- 
minant, there will be n! terms in the expansion. Thus the notation gains in 
brevity as the order of A increases: a5 Χ 5 determinant is a very efficient 
abbreviation for an algebraic expression with 120 terms. 

In dealing with determinants of higher order, one needs to know the 
rule for deciding which of the 2! terms carry a + and which a — sign. 


+ Determinants must be square: there is no such thing as a rectangular determinant. 
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This rule is bound up with the theory of odd and even permutations, and 
we do not need to concern ourselves with it here. 

The value of the 3 x 3 determinant may be written in many alternative 
ways, each of which we abbreviate using the notation of 13.4: 


A, (b6z — δ90.) + δι(ορας — 0963) + €,(dgb3 — agbq) 


= 4,4, + 6,B, + ¢,C,; (1) 
dy(b3c, — by¢3) + bg (3a, — 6744) + 6y(@3b; — 4,53) 
= a,A, + b,By + 6,0, (2) 
3 (bly — bg¢,) + δε(ο,α. — 0,64) + σε(αχθς — Ayb;) 
= 444ς + b3B3 + 630 (3) 
€,(bo¢3 — b3¢g) + ag (b3¢, — by¢3) + a3(byCy — boty) 
= ἀτή, + a,A, + 34s (4) 
δι(ορας — 0962) + by (6g, — 0764) + b3(C,a_ — C244) 
-- b,B, + b,B, + b,B, (5) 
Οοχγ(αοῦς — α40.) + Cy(a3b, — αι}ὃ9) + €3(a,bq -- ab, ) 
= οὐ, + 6,0, + 630 (6) 


(1) is called the ‘expansion of the determinant by the first row’: the 
elements a,, 6,, ¢, of the first row are multiplied by their corresponding 
cofactors A,, B,, C, to form the inner product a,A, + 6,B, + ¢,C,. (2) and 
(3) are expansions by the second and third rows respectively, while in 
(4), (5), (6) we have written the expansion by the first, second and third 
columns. 


We now need to recognise at sight the cofactors of the several terms of the 
determinant 


It will be seen from the examples given above that each cofactor is either 
plus or minus the 2 x 2 determinant obtained by striking out the row and 
column through the element concerned, the appropriate sign being given 
by the chessboard scheme on the right. For example, if we want the co- 
factor ofc, , we strike out the second row and third column, i.e., those which 
intersect in the element cy as in the third part of the diagram, thereby 


ay b, ; : 
, which is known as the minor of ¢o. 
43° 93 


obtaining the determinant 


+ See Budden, F. J., The Fascination of Groups, C.U.P., 1972, p. 312; Moakes, A. J. and Neill, H., 
Vectors, Matrices and Linear Equations, Oliver and Boyd, 1967. 
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The scheme of signs shows that C’, has a value which is the negative of this 


minor, 1.€., 


b 
τι ἘΞ -π-(α,ὃς -- a3b,) = a3b, -- αιὖῦς. 


a; ὧς 


The rule is true for determinants of any order (and enables the general 
determinant to be defined recursively) : 


Signs: + —- + — + 

ae. ἘΞ ΘῈ ΠΩ 

Ἐπ + 
a, ὦ, ¢, & a, 6, Cy @ 
. ao by Cy 6. ig: ὦ. bo 90 

The minor of d, = sothat ἢ, = — 

a, bs Cz 64 43 ὃς C3 e5 
ad, by Cy 6. ie Og. be δὰ 


In theoretical work we prefer to work with cofactors rather than minors, 
since the expansions as in (1)...(6) do not contain any — signs, the co- 
factors themselves incorporating the appropriate signs. The latter are 
therefore sometimes referred to as ‘rectified’ minors. 


Exercise 13.5b 


1 Evaluate the following determinants and check your answers by ex- 
panding them in several different ways: 


Ὁ: 59. 95. eS | -l 3 1 
Gila “O° 7215 ΠΣ soe Δ} (az) |—6 12 3); 
2-l 4 3 3 4 6 1 0 
Pces2 =o. 
60 18 49 
ΞΕ; io ea 
(v)}50 O —14; (v) 
9 steed: “Ὁ 
20 —6 —28 
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2 Expand the determinants 


abe ac ὦ abe 
CG: ὁ. boa cl, b ¢ al, 
b ¢ a ¢c «ἃ c α ὁ 


and deduce the values of 


ab 0 1 x? x 
b Ὁ al and {x Ι x? 
Oa b x7 χ 1 


4 Evaluate the cofactors of some of the elements in the above deter- 
minants. 


ah g 
4 Write down the cofactorsin |h 6 f| beginning A = be — f?. 


ae 
Show that hA + 6H + fG = 0, and obtain similar results. 


5 Solve the equation for x: 


2 -- χ —3 5 


0 oats 2 |= 0. 
8 =—6. 8% 
2-3 1 
6 If A=|1 4 6), show that |A| = 0, 
0-1 --ὶ 
2 --ἀά --2 
and that [46]. A| LS) aa 
S| od 1} 


What do you notice? (Compare exercise 13.5d, no. 8). 


7 Expand the determinants: 
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8 Eliminate x, y, and z from: 


(4) x + ay — bz=0 (1) aly —z) =x 
—ax +y +z = 0; δ(Ζ -- χ) ΞΞ γ. 
bx —cy+z=0 C(x —y) =z 


g What is the volume of a parallelepiped, three adjacent edges of which 
are the vectors OP,, where P, is (x,, y,, Z-), 7 = 1, 2,3. Find the volume 
of a tetrahedron having vertices O (0, 0,0), P, (—2, 1,0), Ρ (3, 1, 1), 
P, (1,0, —4). 


10 Show that the result of eliminating x from the equations 


abe 0O 

ax? + bx +c¢=0 OY « ὁ ε 
can be expressed = 0. 

px? + qxtr=0 Bo GAts 

0 g Υ 


(Compare exercise 13.5a, no. 8). 


11 Eliminate « from the equations 
ax> + bx +co=0 and px? t+ gx? +rmx+4+5=0, 


giving the result asa 5 x 5 determinant equated to zero. 


Row and column operations 


Before dealing with the rules for the manipulation of determinants, we 
shall introduce a notation which will be found convenient throughout the 
remainder of this chapter for both matrices and determinants. 

We may regard a matrix as a collection of row vectors which we shall 
callr,,r,,7r3, etc., and similarly the columns will be denoted c,,c¢,,¢3,.... 


0 3-1 —] 
So, in the determinant 2 5 11, ©, denotes the vector Ι 
—-1 06 2 2 


Furthermore, we shall use a notation similar to that of transformations 
to indicate operations of replacing individual rows or columns. For 
example, γὼ = r, — 2r, simply means ‘create a new second row by 
subtracting from its elements twice the corresponding elements of the first 
row ’; so that the above determinant becomes 


0 3 --] 0 3-1 
2 -- 2Χ0ὺ 8 --2Χ3 1 -- 2χ--]}. or 2. ΞΞ] 3), 
=| 0 2 —-l1 0 2 


Similarly, the operation ce, = 2c, has the effect of creating a new second 
column by doubling the elements of its predecessor. 
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It may be noticed that row operations on matrices can themselves be 
performed by the use of matrices. For example, ry = rg — 2r, can be 
achieved by pre-multiplying the given matrix by 


00 
--2 1 O}. 
0 0 1 
Similarly, the operation γή = ar, + Br is achieved by pre-multiplying 
by 
Oa β 
0 1 Of, 
0 0 1 
O a ΘᾺ fa, by Gy da, + Ba, ab, + Bb, acy + Pes 
for [0 1 Offa, ὁ, oc J= A» bs Co 
0 0 1 \ag b3 65 As ὃς Cs 


whilst the effect of pre-multiplication by 


1 O O 
0 1 O 
0 0 ἃ 


would be to carry out the operation r; = ar,. Matrices which perform 
these row operations are known as elementary matrices and we shall use them 
extensively in the solution of systems of linear equations (13.7). 

Lastly, the reader might like to investigate how column operations can 
be achieved by means of post-multiplication by similar elementary matrices. 


Rules for manipulation of determinants 


We shall not go into the tedious process of proving these rules, but will 
simply illustrate them, letting them speak for themselves. 

(1) |M| = ΜΜ' 
transposed. 


, 1.e., the determinant of a matrix 1s unchanged when the matrix 15 


KEN S.. He plesk (the reader should verify that the expansion of 
ee ee ν A’ agrees term by term with that of A). 


(11) Multiple of row (or column), 


ka, ὦ, Cy αι ὦ, ο, 
eg., Δ' =|ka, δ. ¢y| = kA; A'=|a 0b, 6¢, |= AA, etc. 
ka, bs Cs ha, Ab, 2s 
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(111) When two rows (or columns) are interchanged, the determinant changes sign, 


ma 
Cy ὃ, αι 

6.5.. A’ =|c, by α;), τ —A: this may be verified by actual expansion. 
cz ὃς ag 


Corollaries: 


(a) Iftwo rows (or columns) are identical, the determinant has the value zero (for in 
this case: A = —A => Δ Ξ 0). 


(ὁ) If two rows (or columns) are proportional, then the determinant is zero, 


The two proportional rows (or columns) are, in fact, linearly depen- 
dent (see 13.1) and the determinant is also zero if its three rows (or 


columns) are linearly dependent, but this cannot usually be spotted at a 
glance. 


(10) A determinant is unchanged when to the elements of any row (or column) are 
added a constant multiple of the elements of any other row (or column). 


For example, 


a; ὦ, ¢; 
if A=ja, by Col, 
a4, bs Cg 


let ri =r, + Arg. 


a, Ἔ λα, ὁ, + Abn 61 + Mey 


tyne 
Then A’ = A> bo Co 
a3 bs C3 
ἄχ, Ox ἢ ay ὃ. Cy 


“lag ὦ9 C3 a, ὃς C3 
=A + 1x0 
— Ae 


Finally, it is also worth expressing corollary 111 (a) in terms of cofactors. 


Suppose, for instance, that the second row of A is replaced by its first row 
(1.6.. 75 = 7,). | 


a, ὦ, τ᾽ 
Then |a, ὦ ὦ Ξξόέ [15 replaced by 


a; ὃς ὃς 
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a, ὃ, δ, 
a, 6, ¢,|= 90 (since two rows are identical). 
dz bs ¢3 


Now the cofactors of the second row have been unaffected by the replace- 
ment, so the equation 


ayA, + boBy + οὐ. = A 

has been replaced by 

a,A, + b,B, + ¢,C, = 0. 

This expression consists of the elements of row | of the original determinant 


taken with the cofactors of a different row (row 2), or alien cofactors as they 
are usually called. 


Similarly a,A3 + 5,B; + οἷς = 0; 
and more generally we may say, calling any such expression an inner 


product, that 


The inner product of the elements of a row (or column) with the 
cofactors of the same row (or column) gives the value of the determinant; 


but the inner product of the elements of a row (or column) with the 
alien cofactors of a different row (or column) gives zero. 


Example | 
2 4 18 
Evaluate A=|~3 10 --27:. 
1 3 #11 


We try to use row and column operations to obtain as many zeros as 


possible: 


1 2 9 

A=2/-3 10 --27 

1 3 Ql 

12 9 
=2!-3 10 —27| (rs - γς -- τι} 

0 1 2 

1 2 5 


= 2\)-3 10 —47| (e3 =e; — 209) 
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J 5 
ee, 
= —2(—47 + 15) 
= 64. 
Alternatively, 
Ι 2 9 it ae Ὁ 
A=2|-3 10 —27| =2/0 16 O| (ry -Ξ γα + 3r,) 
EF 31] Es .11] 
Ι 
= 32 , = 32 x 2 = 64 
Example 2 
la@ 


Factorise 1 ὁ 0}. 
Ee 


Putting ὁ = ὁ gives two identical rows, thus causing the determinant to 
vanish. Therefore, by the remainder theorem, (ὦ — 6) isa factor. Similarly 
(c — a) and (a — 5B) are also factors. Hence A = (ὁ — c)(c¢ — a)(a — b)F, 
and consideration of the degree of the determinant enables us to see that 
the remaining factor F must have degree 1. Clearly it is symmetric in 
a, b and c, so must have the form k(a + ὁ + c), where kis a constant. The 
‘leading term’ 1δε (the product of the elements in the leading diagonal) 
has coefficient 1, and this shows us that k = 1. Hence 


A= (6-—c)\(¢c —a)(a— b\(a+b+4+c) 


Example 3 
abea 
abe 
Evaluate A = 
¢c dab 
bc da 
atdt+c+b bt+atd+c c+tbt+atd dtct+db+a 
d | b 
Now A = ᾿ 
C d a b 
b C d a 


(ri =r, +r, +73 4+ 74) 
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1 1] 1 1 
d Be OG 
=(a+b+c+d) 
c dai ib 
b ς da 


It 1s, in fact, unnecessary to go so far: all we need 1s to obtain the factor 
(a+b+c+d). Similarly, γι =r, — ro + rz — rq detects the factor 
(a-d+c—6), and so A= (a+6+c¢+d)(a—b+c-—d)F. The 
remaining factor Μ᾿, being homogeneous of degree 2, may contain terms in 
α΄, δ“, c*, 4“, ab, ac, ad, bc, bd, cd. But it is easier to find it by using complex 
numbers. For γ᾽ = r,; + irg — rz — ir, replaces r, by 

(at+id—c—ib b+ia—d—i ct+ib—a-—id d + ic — ὃ — 1a) 
=(a—ibh-c+id i(a—ib—c+id) —(a—ib—c+ id) 

—i(a — ib -- ὁ + id)), 
showing that (a — 1b —c¢ + id) is a factor. Similarly, the conjugate 
(a + 1b — ¢ — 2d) 15 also a factor, and we get 


A= (α« -᾿ ὁ Ῥ ὁ - 4)(α -- ὃ - ὁ -- αγ(α -Ἐ τ -- ς -- id)(a— ib —¢ + id). 


Finally, combining the conjugate factors gives 


A=(a+tb+c+d)(a—b+e-—d)(a° +b? +c? + 4 — 2ac — 2bd). 


Exercise 13.5c 


1 Perform the operations specified on the given determinants, and hence 
evaluate: 


4-18 3 
9 19 4 (ee | + 7. 
() = r;, = 2.4 + 2ro; 
—4 0 5 
Ι΄ a ὁ , 
(i) be Δ ὁ“ = a ἢ . 
ΤΙ aE ye ): 
Ι α΄ 0 
a—b—e 2a 2a 
Co = Cy — 
Gii)| 26 b=~c—a 2% ἄν 
Yo = 1. + 733 
2¢ 2¢ c—a-—b 
αὐ ε 
(wv) |b c¢ a οἱ = οι + Me, + w*e3. where w? = 1. 
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abe 

2 Use the cube roots of unity to factorise |c ἃ ὦ 
bc a 


(r; =r, +r. + r3; compare the text above, and exercise 10.4b, no 9). 


3 Use the result of no. 2 above to write down the factors of 


a --ἸΙ 0 i @ Δ" 
0 a-—l1}| andof ja* 1 = a 
— | 0 a a a> | 


What is the value of the latter determinant when a is a complex cube root 
of unity? 


4 Factorise the following determinants: 


1 be a ie a στὰ α-πὖ 
os. i eae be (a) | be ca αὖ \\\e 

Ι abc a b C 

b+c cta at+bdb α ὧς «Ge l «α« ῥτε 
(111) ἢ a b |; (iv) |b τα 67; (vy) |l b e+ al. 

l ] l ὁ ab οἰ l ¢ at+b 
5 Show that: 
a b C 


a’ b? C k(a+b-+c)(b —c\(c — αγία — 5), 
b+e cta a+b 


and find the value of the constant k. 


baa 
6 Evaluate ja 06 al, 


aa b 


and find other possible values of a 3 x 3 determinant which contains six 
a’s and three ὁ᾽ 5. 


7 xpress in factcrised form the determinants: 


Ι΄ aaa 
pF tag? ab ac 
ie Nie ῤαᾳ α 
(2) ab ct + a? δὲ |; (12 ) 
l1 a  α 
ca ch a’ + δ" 
l1 aa ὁ 
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od 1 1 abea abe 
tl π|᾿ 1 11 jo a ἀ ε Db a 
ὡ 1-1 -- 1 | Ὁ c da bl ") a b d cl 
1-1 -1 --Ἰ dc ba bade 
1 2 4 
8 Evaluate all the cofactors in the determinant {2 4 δι. 
4 8 16 


Write down several other examples of determinants which have all the 
2 Χ 2 minors equal to zero. 
Compose a 4 X 4 determinant which has every one ofits 3 x 3 minors zero. 


Multiplication rule for determinants 


Suppose that A and Bare two 3 x 3 matrices and that their determinants 
are denoted by |A| and BI. Now we have seen (in 13.3) how to form, by 
row and column multiplication, their product matrix AB, which in turn 
will have a determinant denoted by |AB}. Is it, we wonder, possible to 
state the value of [48] simply from our knowledge of |A| and |B| without 
having to find the product AB and then go through the tedious process of 
calculating its determinant? 

A little reflection will show that this is easy. For if the successive trans- 
formations 


x” = Ax’ x’ = Bx 

are combined to give 

x” = ABx, 

volumes are multiplied by the factors |A| and [8] under the separate trans- 


formations, and by |AB| under the combined transformation AB (see 13.2). 
But the final effects in both cases must be identical. 


Similarly |BA| = |B| x |A], so AB and BA have the same determinant, 
even though the matrices themselves may well be different. 

A similar proof, using areas, applies in the case of 2 x 2 determinants 
and the result is also generally true for determinants of any order. 


Exercise 13.5d 
1 (2) Verify, by expansion of the determinants, that 


α By α δ᾽, \aa+ Bo αὐ + Bd 
γ ὃ c ὦ |lyat+ δὲ yb + ddl’ 
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if b 

(11) Express the square of D = ὗ ; in four different ways asa2 x 2 
C 

determinant. 

(Note that if D’ = ; ᾿ then we may have DD’, D'D, DD and 

D'D’.) 


2 Find a counter-example to show that, if X, Y, Z are square matrices, 
such that Z = X + Y, then det X + det Y det Z. 


αι 0 
3 If D=/b 0 cl, express D* as a 3 x 3 determinant in two 
0 a ὁ 


different ways. 
a* +p? ab+pq act pr 
4 Express jab + pg 6? τ φ' be + ar 
act pr be+qr +r 
as the square of a determinant, and hence prove that it is zero. 
Ges Oi: 5 
5 Using the notation la b c| todenote [dy b5 Col, 
ag bsg Cg 
where a, δ. c are column vectors, prove that 
la+x δ οἱ =|a b οἱ + |x b el, 
and that lat+x+y b c| == la b οἱ “fe Ix b c| Ἔ ly b οἱ. 
ἈΠ express ja ἐκ b+y e+ z| as the sum of eight determinants. 


6 Prove the row-by-column rule for multiplication of 3 x 3 determinants 
by expressing the determinant of the product matrix in 13.3 (see p. 399) as 
the sum of 27 determinants, 21 of which may be proved to be zero. 


a, ὦ, ὁ A, A, A; 
7 If A=/a, by ¢Cy| and A* =|B, By, 8Β.), (see 13.4, p. 410), 
a3 bs 65 C, ὦ ὦ 
B, Bs, 


show that ΔῈ = Δ΄, and that 


C, ὦ 
with similar results for the other minors of A*. 


8 Ifa3 x 3 determinant is zero, prove that any pair of rows or of columns 
of the adjoint determinant are linearly dependent, 

A A A : 
e.g. that B, = B, = Be assuming δι. By, Bz 4 0 


(use the result of the previous question). 
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g Show how to construct 2 x 2 and 3 x 3 matrices with elements which 
are integers so that their inverses may also have elements which are 
integers. (cf. ‘codes’, pp. 420-423). 


*13.6 Systems of linear equations 


Use of determinants in the solution of linear equations 


For practical purposes, particularly in the case of large numbers of equa- 
tions with large numbers of unknowns (as are frequently encountered in 
commercial, industrial and scientific situations), the method of reducing 
to échelon form (see 13.7) is the most convenient. But it is also instructive 
to approach the problem of the solution οἵ ἃ set of linear equations by using 
determinants. 

We shall merely hint at the case of a2 Χ 2 system: 


by =c b 
mer OY ‘> by noting that, if A = oe ee ; 
ἄοχ + boy = Co a, ὦ. 
the solutions may be written 
Nei Cy za Ay = ay ral. 

Cy be G2 9 


these being true in the non-regular case (A = 0) as well as in the regular 


case. (cf. pp. 417-419). 
Proceeding immediately to the 3 x 3 case, 


ax + by + ez = dy (1) 
ἀχΧ + boy + Coz = dy (2) 
ἄ4Χ + boy + (42 = ds, (3) 


a possible elementary way of obtaining solutions would be to eliminate z 
from (1) and (2) to obtain a new equation (4) in x and y; then to eliminate 
z from (2) and (3) to obtain a second equation (5) in x and y; and finally 
to eliminate y from (4) and (5) thereby enabling x to be found. How much 
simpler if this multi-step solution could be telescoped into a single step! 
In fact, this can be done by multiplying (1) by A,, (2) by Ag, (3) by Az, 
and adding, leading to 


(a,A,; + a,Ay + a3A3)x + (6,4, + b,Ay + 53A3)y 

+ (¢,A, + 2A, + €3A5)zZ = d,A, + d,A, + Az. 
In view of the properties of cofactors (see 13.5), we know 
a,A, + a A, + a,A, = A, 


while the coefficients of y and z are zero because the elements of columns 2 
and 3 have been combined with ‘alien’ cofactors, those of column |. Thus 
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y and z have been eliminated in a single tour de force ! ‘The expression on the 
R.H.S. has the value 


dy ὃ, CG 
d,A, + 4.4. + d;A, =\d, by 2.5} = det (d, b,c), 
ἄς ὃς Cs 


where det (d, δ. c), which we will call A,, is simply the determinant A 
with column | (the col. vector a), replaced by the col. vector d. 


Thus Ax = A,; and similarly we obtain 
Ay = A,, 


where A, is A with its second column replaced by the column vector d; 
and finally, 


ἌΣ = Nes 


If we make use of the matrix M=fa, ὁ» ὦ. 


and its adjoint matrix M* (see 13.4), the foregoing proof can be consider- 
ably shortened. For the original equations (1), (2), (3) can be written in 
matrix form as © 


Mx=d 
=> M*Mx = M*d 
Ξξὸ: Alx = M*d 


=> Ax = M*d 
Ax\ A, 4, Az\ {dj Ax = A,d, + Agdy + 494. 
=> Ay}={B, B, Bz|{d.) = Ay = B,d, + Bod, + Bd; 
Az Ge. Gs. “Cay Nas Az = C,d, + Cod, + Cad, 


So, again Ax = A,, Ay = A,, Az = Ἂς. 


This solution in determinant form is known as Cramer’s rule and is 
clearly capable of generalisation to any number of equations. But it 
suffers from the very serious disadvantage that evaluation of determinants 
is usually extremely tedious. 
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Example I 


Solve the equations 


i | eee eae 
x-yt+tz=2 
2x —z=4 
1 1} - 
Here sh —1 Iy=101) — 1(-—3) ~ 12) = 2 
2 5 
1 1-1 
+f: -1 1)=1(1) -- 1(-6) — 1(4) = 3 
4 0-1 
1 1-1 
“|! 2 1|Ξ ᾿ἰ(--θ) — 1(-—3) — 1(0) = -3 
2 4 -- 
1 1 1 
A, =|1 -—1 2,- 1(-—4) — 1(0) 4+ 1(2) = —2 
[2 0 4 


So 2x = 3, 2y = —3, 2 eal 
=> = 3, y= —3, z= —] 


(as we could have found more simply by elimination!). 


Example 2 

Solve the homogeneous equations 
a,x+ by +¢,z=0 

A,X + boy + Coz = 0 

A3x + δον + 6942 = 0. 

Here it is immediately clear that 


A, = A, =A, = 0 => Ax = Ay = Az = 0. 
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So if A ¥ 0, it follows that the equations have a unique set of solutions 
x = y = z = 0; in other words, if there should exist a set of solutions 


which are not all zero, then we may be sure that A = 0. 


Conversely (see exercise 13.6a, no. 8), it can be shown that if A = 0, 
then it is possible to find values of x, y, z which are not all zero satisfying the 


original equations. 
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The existence of values x, y, z not all zero 


So 


satisfying the homogeneous equations 


Regarding these three equations as representing three planes through 
the origin O, we see that they also have some other point P in common if, 
and only if, they intersect in at least a line (OP), in which case their equa- 
tions are linearly dependent and A = 0. 

But this important result may also be reconsidered in several other 
ways: 

First, A = 0 is the condition for the three straight lines a,x + by + ¢; = 0 
(1 = 1, 2, 3) to be concurrent (see 13.5, p. 425). 

Second, A = [Μ| = 0 is also the condition for the matrix M to be singular 
(see 13.4), that is, for the non-existence of an inverse. 


Third, in the context of the system of non-homogeneous equations 
Mx=d (d #0) 


is the condition for the system to be non-regular, and this we shall deal 
with in 13.7. 

Fourth, |M| is the value of the scalar triple product a.(b Χ 6) which 
represents the volume of the parallelepiped formed by the vectors a, ὃ, c 
as adjacent edges (see 13.2). Hence |M| = 0 is the condition for its volume 
to vanish, and so for the vectors a, b, ὁ to be linearly dependent. And, 
similarly, it is also the condition for the three row-vectors 


(a;, δ, ἡ (1 = 1,2, 3) to be linearly dependent. 


Example 3 


It is also useful to see how Cramer’s rule can be obtained by vector methods. 
For equations (1), (2), and (3) above may be written in vector form as 


xa + yb + ze = d. 


Remembering that vector product distributes over addition, we take the 
vector product of both sides X 8, giving 


xaXb+ybxb+2zexb=d~X Bb. 
But Bx b= 0. 
Hence xa X 6+ 2exb=ada~x ὃ. 


Now take the scalar product of both sides with ec, and again remember 
that scalar product distributes over addition. 


Then x(a Χ b).e + z(e xX b).c = (ad Χ δ).ο. 
But (ὁ X b).c = 0. 
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Hence x(a xX b).c = (d X δ).ο. 
(αοὖς — dgbo)t + (a3b, — a1b3)J + (α,δ. — a2b,)k 
C,t + cog + gk. 
So (aX b).c = (aob3 — agby)e, + (a3b, — ayb3)eg + (αι. — agb,)c3 


But ax ὃ 


Ι 


and C 


Gd Dy ὦ 
=|a, ὦ, ¢,|= A; 
a; bs 68 
dq, ὃ, 
and similarly, (d X b).c=jd. by ¢o|= A,. 
d,; ὃς Cg 


So Ax =A,, and similarly Ay = Ag, Az = Ag. 


Exercise 13.6a 


1 Use Cramer’s rule to solve: 
(2) 2χ —-3Sy+3=0 (1) 2x — 3y ~1=0 
3x ty —11=0; 4x — by + 5 = O; 
(2111) ix + my + nz = 0 
mx — ly + pz = 0. 


2 Use Cramer’s rule to solve the simultaneous equations: 


(1) χ -- ΖΞ 4 (1) 4χ -- ῷν -- 22 = --7 
2χ -τνΖ -- 3z= —2 3x + 5y -- 62 = 4 
in P 2y ΟΖ = 3: 6x — l3y + 62 = —29. 


Use Cramer’s rule to solve for cos θ and sin θ in terms of a, , c, p, 4. 
and r, and hence eliminate 0: 
acos@ + bsin@ =c 
pcos@ + qsin 8 = γ. 


4 Obtain the condition for the quadratic equation ax* + bx + ὁ = 0, 
px* + qx - r = 0 to have a common root by first solving these as a pair 
of simultaneous equations in x* and x, using Cramer’s rule, and then 
eliminating x. (Compare exercises 13.5a, no. 8 and 13.5b, no. 10.) 


5 Find the condition that the equations 

ax + by  ὁΖ =0 

bx + cy +az=0 

cx + ay + bz 0 

have a solution other than (x, y, z) = (0, 0, 0). 
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6 Find the values of 4 for which the equations 
4x — 6y — z= Ax 
x—4y—z=hy 
2x + 3y +z = dz 
have a solution other than x = y = z = 0, and find the ratios x:y:z for 
each of these values of A. (o.c.) 


7 If q@ denotes one of the cube roots of 1 other than 1 itself, solve the 
equations 

ΧΕ ἘΖ 
x + oy + @*z=b 


I 


x+o*y+@z=c 
for x, y, Z in terms of a, 6, c. Give your answer in as simple terms as pos- 
sible. (S.M.P. ) 


8 Consider the three homogeneous equations 


ὧν. δὲ GaN EX 0 
dy by Φ]|}ν} =| 0 
ἄς ὅς ty) NZ 0 


/ 

Prove that, if solutions other than x = y = z = 0 exist, then A = 0 (the 
3 x 3 determinant of the coefficients). 

Prove conversely, that if A = 0, then solutions other than the trivial ones 
may be found. 

(Note that Ax = A, = 0, etc., and that if A = 0, then any arbitrary 
value of x will satisfy. Note also that 


b * ὴ 4, 

᾿ is ᾿ yj|= 0 are satisfied by | y }={ Βι} in any case.) 
ee me a 0 Ἵ : 
1 


9 Prove that if «, β are real numbers and « τέ 0, then the equations 

x+y+2z=5 

2x+t+y+tz=2 
3x ty +az= fB 
have a unique solution. Prove further, that if « = 0, then these equations 
have a solution only if βὶ takes one particular value, which should be found. 
Find all the solutions of the equations in this case. (0.5. ) 
10 Given that ἀ is a real constant, solve completely, when possible, the 
simultaneous equations 
kx + 2y + 82 = 0 

Ke ayo 2hz = 0 

x+y + 6z = 2k 

discussing carefully any special cases. (O.S.) 
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11 Three planes through the origin have equations 
I], a + hy + gz =0 
TT, hx + by + fz = 0 
Tl, gx + fy + cz = 0. 


Let l,,7 = 1, 2, 3, be the three planes defined as follows: I’, is the plane 
through the intersection of I], and II,, and also containing the x-axis. 
I, and I, are similarly defined. Prove that ’,, 12, 14 have a common 
point. 


Regular System — ‘ Ill-conditioned’ case 


We know that in the case A ¥ 0 our original system of equations will have 
a unique solution. Note, however, that if A is very small compared with 
A,, Aj, and Ag, the values of x, y, z may be large: the point of inter- 
section of the three planes is a long way off, if we think in terms of the first 
interpretation of p. 418; or the matrix is nearly singular, in the second inter- 
pretation; or the 3 vectors a, b, Ο are very nearly coplanar in the case of the 
third. 

In such a situation, the equations (and the matrix M) are described as 
ill-conditioned. This means that the solution is very sensitive to changes in 
the data: move one of our three planes by a very slight alteration of perhaps 
just one of its coefficients, and the common point of the three planes may be 
moved a long way. 


The singular case, in terms of Cramer’s rule 


In a non-regular set of equations, where A = 0, we know that no finite 
solutions are possible in x, y, z unless all three of the determinants A,, A,, 
A, are zero. This may happen, of course by having d = 0, 1.6., 4, = 0, 
d, = 0, and ἄς = 0. 

In the more general case when d # 0, it may be wondered whether it 
is possible for A, to be zero without A, or A; necessarily being zero. At 
first it seems that this is not possible, for one may argue as follows: 


Firstly, Δ = 0 implies that a, ὃ. ¢ are linearly dependent, 


A, = Oimplies that d, 6, c are linearly dependent. 


Hence it appears that a, b, d are linearly dependent, so that Ag is also 
zero, and similarly A, = 

Alternatively, we may think of the three planes represented by the three 
linear equations. Now since in any case A = 0, we know the three planes 
either have a common line or else form a triangular prism, including the 
case when some pairs are parallel. Consider now the lines in which the 
planes intersect the plane z = 0. 
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They have equations 


ax + by =d a, ὦ, d, 
a,x + boy = dy ¢ and the vanishing of the determinant A, =|a, b, d, 
43x + bay = d, a3 ὃς dz 


ensures that these three lines have a common point (taking the more 
general case when they are not all three parallel). In such conditions, the 
three planes do not form a triangular prism, but will have a common line. 
This means that they also cut the plane y = 0 in three concurrent lines, 
whose equations are 


a,x + ¢,z = d, 
ἄχ + CoZ = dy 
ἀ4Χ + (952 = dz 
and whose determinant should therefore vanish. This would give the result 
A, = 0, and similarly we should have A, = 0. 

However, the above arguments overlook the possibility that A and A, 
vanish as a result of the two vectors 6 and ¢ being linearly dependent. In 


such a case, a and d can be freely chosen as any two vectors, so it would 
clearly be possible to avoid the vanishing of either A, or Aj, or of them both. 


Exercise 13.6b 


1 Consider the lines 
2x + 3y —4=0 
6x + ky ~2 =0 


in the case when k = 9, and when k = 9 + 0, where ὃ is very small. 


2 Find sso that the planes 


2x — 3y+z=¢ 
3x --τχΛν = —4 
x + ky + 2z = 3 


may form a triangular prism in general. 


When & has this value, find ¢ so that the planes may have a common line, 
and check that, in this case, all four of the 3 x 3 determinants of the 3 x 4 
matrix (the ‘augmented matrix’ of the set of equations) are zero. 


In this case, find constants p, g, and 7, such that: 

ῥῶ -3 1 ec) +4(38 -1 0 -4δ[} 4701 & 2 3 -ι, (( 0 0 QO). 
In the case ὁ = 1, find the effect of changing & slightly from its critical 
value. 
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4 (1) Find the values of A and μ for which the equations 
Ax +y =0 
Me = a 
2x + 3y — 2z = 0 
have a solution other than the trivial solution x = y = z = 0. 


(11) For what values of ὁ do the equations 


Ax + y= 1 
x+dAy—z=!1 
2x + 2y — 2z = 1 have a solution? (O.S.) 


4 Find a vector which is perpendicular to each of the following pairs: 
Go 2 1. 9. ands == 2> 10° “ays 

(7) (1 4 —1) and (—-1 2 QO); 

(111) (cosa sina O) and (0, cosf sin f). 


*13.7 Systematic reduction 


Three linear equations with three unknowns 
We now deal with the case of three simultaneous linear equations in the form 
(1) ax + by Ὃ 6,2 = a, 
AX + doy + Coz = do 
a,x + boy + ¢3z = dy 


or (22) Jay. “δὲ τῶν ἢ d, 
ag bo Coll y|}=|d,}; Mx=d 
Ws. (Dg. Gag) NZ ds 
or (111) a, b, Ci d, 
xlagt ty] db.) 4+ Ζ2[ ce) =| 49, |; xa + γὅὃ - Ζο - ἃ 
a; b, C3 ds 


Let us consider now the threefold} interpretation in this case, Just as we 
did with two equations (see 13.4). 


In the first place, as an equation such as 2x — y + 5z = —3 connecting 
the coordinates (x, y, z) of a point in three-dimensional space represents a 
plane, our first interpretation concerns three planes in space; and in the 
general case (when the equations are regular), we expect there to exist a 


T In due course it will become apparent that the second and third interpretations are really 
the same, but expressed differently. The similarity of these to the first interpretation is not 
however so apparent, οὗ 13.4, p. 417. 
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unique point common to the three planes, as for example the three planes 
which intersect at the (‘trihedral’) vertex of a cube. The non-regular 
cases, when there is something unusual, will be considered presently. 


In the second interpretation, we require to find which vector is transformed 
by the matrix M into the given vector d, and in the general case this prob- 
lem again has a unique solution. 


Thirdly, we are attempting to find what multiples of the given vectors 
a, ὃ. ec must be ‘mixed’ together in order to obtain the vector d; and if all 
is well, there will again be a unique answer. 

Whenever the matrix M is non-singular, 1.6., possesses an inverse, the 
unique solution is applicable. For then 


Mx=d => Μ 'Μχα -- Μ᾿ ἰα. 


so that x = ΜΙ ' ἃ is the unique solution. 

The question of actually finding the inverse of a 3 X 3 matrix was dis- 
cussed in 13.4, and a second method will emerge in this section (p. 455). 
But in numerical cases, even with the facility of a computer, it is usually 
simpler to use another method, rather than to compute the inverse. 
Similarly, a problem in mechanics might lead to the following set of 
equations, in which 7, S represent unknown tensions, while a, 6, and ¢ 
represent unknown accelerations: 


mg-S=m,a; mg -- T= moc; ¢ = 2a — 2b; 
msg — 21 = mzb; 5 —2T = mya. 


In such a case, the values of the unknowns are best found by ‘common- 
sense’ elimination; it would be absurd to force the problem into the 
format of a5 x 5 matrix with a very large number of zeros. 


Echelon form 

Suppose we are asked to express 

27° — 5r? — 3r + 11. in the form 

Αγίγ + l)(r + 2) + Bry +1) 4+ Cr + 8. 


Put += 0, ll = D (1) 
r= —l, 7 = —-C+D (2) 
t=] 2, 29S 2B = 26--D (3) 
r= l, 53=64+ 28+ C+D (4) 
Equation (1) gives immediately D = 11; substitution into (2) gives ὦ = 4. 
Knowing C and D, (3) gives B = —11, and finally (4) enables us to find 
A=. 


Thus 27° — 5r7 — 3r + 11 = 2γί(γ - 1) (rn + 2) — Ir(r - 1) 4+ 4γ 4-211. 
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Now these equations were particularly easy to solve because of the form of 
the 4 x 4 matrix 


060 207 1\. JA 11 
0 041 11[8 7 
02-2 16} \-19 
62 1 1) 5 


where it will be noticed that there is a triangle of zeros above the N.E.-S.W. 
diagonal. Such a matrix is said to be in triangular, or échelon, form. Had we 
written the four equations in reverse order, we should have had 


6 2 1 I /[A 5 
02 —-1 1}[8 —19 
0 ἜΤ Nc] 7 
00 0.1] 1] 


The latter is a more usual form for triangular matrices — we prefer to have 
all the zeros either (as in this case) below the leading diagonal (N.W. toS.E.) 
or above it, thus: 


0 0 
as ib. A X 0 0 (leading diagonal marked 


0 0. = \ 0 in each case) 
0 0 a τ τον ἊΝ 


(lower diagonal form) (upper diagonal form) 


Reduction to échelon form 


If we start with a set of linear equations, we may attempt their solution 
by trying to obtain from them an equivalent set of equations whose matrix 1s 
triangular. The evaluation of the several unknowns will then be a simple 
matter, as in the example above. The method of reduction to such an 
échelon form, resembles the method used in elementary algebra: 


Example 1 


Solve the equations 


χ-- Ζ = 4 (1) 
Qxn+ yt 3z= -2 (2) 
4x + 2y + 5z = —3 (3) 


Multiply equation (1) by 2, and subtract from equation (2). 
Also, multiply equation (1) by 4, and subtract from equation (3). 
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These operations we shall abbreviate respectively: (2) — 2x (1), 


and: (3) —4~x (1) 
x 2S 4 (1) 
Then y+ 5z= —10 (5) = (2) --2χ (1) 
2y + 9z = -- 19 (6) = (3) --4 χ (1) 
x -- 2 Ξξ 4 (1) 
-- y+ 52 = —10 (5) 
--2 l (7) = (6) —2 x (5) 
The equations are now in échelon form, and by ‘going up the ladder’ 
(échelon = ladder), we obtain successively z = —1, y = —5, x = 3. In 
this particular example, there was a short cut, which we have ignored, 
namely (3) — 2x (2), which leads to z = —1 straight away, but we must 


be prepared for less fortunate cases! 
Reminding ourselves of the notation introduced in 13.5, the working 
can now be condensed: 


1 0 -] 4 
Yo = PY, — 24, O 1 5 —10 
γς =r, — 4r, 0... Qe-=19 
1 0 --|Ι 4 
O 1 5 —10 
γᾷ =9r, — 2r, 0 0 - l 


corresponding to equations (1), (6) and (7) above. 


We note the three interpretations of the solution (x, y, z) = (3, —5, —1): 
(2) the three planes intersect at the point with coordinates (3, —5, — 1); 


or (11) in the linear transformation of three-dimensional space represented 
by the matrix 


1 0-1 4 
2 1 34, the vector —2 
4 2 $5 —3 
3 
arises from the vector | —5 }. 
—] 
l 0 —] 4 
or (111) Letting a={2}, BbB=f 1], e= 3}, d=] -2]; 
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we have found that 
d = 3a — 5b — «¢, 


so that d can be expressed as a linear combination of a, ὃ, and 6. 


Example 2 
AS Pp SZ 7 
3x + Sy — 6z = 4 


Ys yp ΞΞΟ ΕΖ, SS: 9: 


We show the matrix of coefficients at successive stages, and on the right 
give the values of the row sums (see below) 


4 -ἸΔ -2 -7 —6 
3 5-6 4 6 
το τ Sy = ho 
γ᾽ =2r, —1; 7? + 0 5--ο 3 
3 5-6 4 6 
Ι —3 —4 --9 —15 


This time we are going to obtain an ‘upper’ triangular form. 


7 1 6. -Ὦῷ 3 
r, = 27. — 375 3 19 O 35 57 
Ι =3 -—4 -ὁὁ, =I15 


It remains to get a zero as the second element in the first row without dis- 
turbing the two zeros already obtained. 


r, = 19r, — rp 130 O 0 —130 0 
3 19 0 35 57 
1-3 -4 -9/ -—-15 
The equations are now in échelon form: 130x = — 130 
3x + 19y = 20 
χ-- 3y-4z= -9Ὁ 


andso x= —l, y= 2, ee 


Note that r; = 2r, — rz is equivalent to the two steps γῇ = 2r, (ie., 
multiply throughout equation (1) by 2), followed by ri = r,; — rz. We 
might have used γ᾽ = r, — ἐγ, but this would have caused the work to 
become encumbered by fractions. On the right, a tally is shown of row 
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sums, providing a useful check on accuracy, which is always used in 
practice. For example, when we applied r, = 2r, — 3r,, the sum of the 
newly formed second row (57) does satisfy the relation: 


57 = 2x6 — 3(—15). 


Exercise 13.7a 


1 Solve the following sets of equations using the method of the above 
section, showing the row (or column) operations, and keeping a check of 
row sums: 


(1) 2x + 2y -- 3z=0 (11) 2χ -- 7y+4z= 2 
3x + 2y -- 10z = 7 4x + 2z = —3 
3x + 4y + z= 8; IS ey eee 

(1) xt yt z= 1 


1 2 34h yJ=Ht_—-1 43 
1 3 -1/ \z Ι 
(v) 4 6 5\ Λα 17 
—3 8 144, y],=[ 67 
9 =4 2/ \z —8 


2 Find numbers f, 4, r such that: 
‘T\ -} ‘] ] 
ρΡ 1} Ἐφ Lytr] 2|Ξ{0. 
2 =] 3 Z 
3 Solve the homogeneous equations for the ratios x:y:z:w: 


Lae PS Oa ev 
2x — 3y + 4z — 8w = 0 
x + 2y + 5z + 3w = 0. 


4 Let A stand for the matrix 


17 —4 0 
23 49 3 
ι 13 1 


Find two matrices M, , M, such that: 
(1) each has only one non-zero entry off the leading diagonal; 
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(11) the product M,M,A has zeros above and to the right of the leading 
diagonal, so that its final form is 


0 0 
0 


where the places indicated by the dots are occupied by numbers. 
From your result, deduce the value of the determinant of A. (S.M.P.) 


5 Describe a method of finding the solution of three simultaneous linear 
equations and illustrate your answer by solving the system: 


3.9% + 2.5y — 0.52 = 4.3 
5.2x + 2.9y — 1.6z = 10.9 
2.6x + 1.8y — 0.7z = 2.8 


working to three significant figures. (M.E.I.) 


6 Use the method of pivotal condensation,f with a sum check, or any 
other suitable method, to find the values of x, y, z from the following 
simultaneous equations. Tabulate the details of your working, and give 
results to three significant figures (you may use a slide rule). 


9.37x + 3.04y — 2.442 = 9.23 
3.04x + 6.18y + 1.222 = 8.20 
— 2.44% + 1.22y + 8.442 = 3.93. 


Find the residual when your results are substituted in the second equation. 
Comment on the value of this residual as a guide to the accuracy of your 
results. (M.E.I. ) 


Identity matrix by using row operations 


It 1s possible to proceed still further in this process. Reverting to Example 1, 
and resuming from the point we reached when triangular form was 
achieved: | 


1 0 -1 4 4 
O 1 5 —10 —4 
0 0-1 ] 0 
γ᾽ =9r,—1; 1 0 0 3 
0 1 5 —10 —4 
0 0-1 ᾿ 0 
00 3 
ro =P. + Org ΘΟ 1 0-5 —4 
0 0-1 1 0 


Τ 1.e. by reduction to échelon form. 
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Finally 
1 0 0 8 4 
0 1 0 --ὁ —4 
r; = —Yr, gives 0 0 1-1 0 


and it will be seen that we have effectively succeeded in setting up the 
identity matrix on the left-hand side of the equation. However, it is not 
necessary to go beyond the point where the échelon form is obtained. 

It is instructive to follow through the above working using elementary 
matrices. In Example 1, the successive operations 


/ 


γῇ τ τῷ -- γι, rh τ τῷ - ἄγ, and γῇ τὸ τῷ -- 2r, 


are carried out by using 


1 0 0 1 0 O 1 0 0 
—2 1 Qf, O 1 Q and 0 1 0 
0 0 | —4 0 1 0-2 1 


as pre-multipliers. Their combined effect 1s the product 


1 O 0 1 0 0 00 00 
O 1 O 0 1 O —-2 1 OF = 4-2 1 OF; 
0-2 1 -,4Ἃ 0 1 0 0 1 Ora 4 


and when this pre-multiplies the original matrix, we obtain 


1 O 1 l QO-1l 4 1 O 59} 4 
—2 1 OFF 2 1 3 -27= 70 1 5 —10 
ὁ --͵Ὁ ly \4 2 98 --9 0 Ὁ: Ϊ 


corresponding to the result above. 


1 0 0 Ι|0-Ι 
Thus the matrix |{—2 1 Of pre-multiplymg [21 3 
0 -Σ 1 4 2 8 


changes it into lower triangular form. 
Proceeding with the three succeeding transformations, 


Ν᾽ ors fol Bs wes. 
r, = ὁ — 2&3; ro = 12 + 9Fs, ΤΕΣ 


their combined effect is given in a similar way by the product of the ele- 
mentary matrices: 
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Ι 0 01 0 Ove 0 ed I Orel 
O 1 OFFO 1 SFLO 1 OF =FO 1 9 
0  Ξιχλο Ὁ Te 1 0 0 ΞΞΊ 


The combined effect of all six matrices is given by 


1 0 9] 1 QO 0O Leal 
O 1 95 —-2 1 OF =]-2 -9 95 
0 0 --Ι 0.2): J 0. aad 


This, pre-multiplying the original equation, gives: 


1 2-1! I Q@Q-1 4 00 3 
—2 -9 SHI2 1 3 -2]|Ξ΄ Ὸ 1 0-5 
ΘΟ 2-l/ \4 2 5 -3 0 0 1 -Ι 


It will therefore be seen that, by making use of the six elementary matrices 
we have succeeded in showing that 


1 2-1] 1 0 —1 
—2 -ὁ 5 isthe inverseof [2 1 3). 
0 2 --Ἰ| 0 2 5 


Formation of inverse matrix by row operations 


The above process gives a clue to the formation of the inverse of any non- 
singular 3 x 3 matrix, starting from the unit matrix and applying the 
appropriate row operations in turn. These are shown in the centre, with 
the corresponding matrices on each side: 


Ι 0 - 1 QO 0 
original 2 1 3 0 1 0 unit | 
matrix — matrix 

a 2: 9 0 0O 1 

1 0-1 1 O 0 

O 1 95] ro =r, — 2r, —2 1 ὦ 

4 2 5 0 0 1 

1 0 -1 1 QO 0 

O 1] 5 ΞΕ 11 Ὁ 


Θ 
NO 
CO 
= 
ie) 
| 
"ἡ 
ῳῦ 
| 
ἘΝ 
3 
| 
ἘΝ 
cS 
— 
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1 0 --|ἰ ] 0 0 
ae 0] 5 _9 | 0 
orm — 
1 0 O\ r,="r,-7; ᾿ yo 
01 5 —? l 0 
0 0 --|Ι 0 —2 l 
1 0 0 i 2 Ξ 
O 1 0] r, =r, 4+ Ors ὅπ Ὁ Ὁ 
0 0 - 0 -- l 
Ι O 0 l 2 --Ἰ 
unit” 0 9 9 5 — inverse 
matrix — matrix 
0 O l et a 0 2 --ἰΙ 


In the process of forming the inverse, it is not necessary to pass through 
the échelon stage, so long as one contrives to produce zeros in the correct 
positions. Moreover, the process involving row operations could be replaced 
by one using column operations, but row and column operations must not be 
mixed, for column operations are equivalent to post-multiplication by 
elementary matrices. 

This method of inverting matrices, by using ether row operations or 
column operations, is basically the one used in practice, e.g., by computers 
in the case of larger matrices, the elements of which are possibly awkward 
numbers. Even using a computer, the method is a great deal shorter} than 
that of forming the adjoint matrix, which is chiefly of theoretical interest. 


Exercise 13.7b 


1 Find the inverses of the following square matrices, using the method of 
the preceding section: 


4 3 3 7) --ῦ oie {4 θ 
a {ἢ ee 


2... ἘΞῸ 2—l1 4 Eck 
(vw) {3 2 —104; (v) 4 0 24; (vw) 1 —-2 14; 
3 4 Ι ὅτ ποῦν ὦ 2. sor2 
ΠῚ - 4 6 9 
(υ1}}} 2 4; (vw) {-3 8 14 
a 9-4 2 


+ See Neill, H. and Moakes, A. J., Vectors, Matrices and Linear Equations, Oliver and Boyd, 
1967, p. 157. 
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2 Use the results of no. 1 above to solve the equations in exercise 13.74, 
no. 1. 
(Use row operations in most cases, but try using column operations in 
others. ) 

1-1 1 


3 Find the inverse of the matrix M=]3 —9 ὃ 


] 3 3 
Rewrite the set of equations x — y+ z= 3 
3x — 9y+5z= 6 
x — 3y + 3z = 13 
in a form using the matrix M, and hence solve the equations. (M.E.I.) 


4 Use the method of this section to find the inverses of some of the matrices 
in exercise 13.4a. 


5 Construct some examples of singular 3 x 3 matrices: 

(2) by arranging for the 3 x 3 determinant to have the value zero. 

(11) by arranging the third row to be linearly dependent on the first two 
rows (1.6., 73 = ar, + br, see p. 430). 


6 Express the following simultaneous linear equations in matrix form 
x+ yt2z= 4 X+ Y+Z= 3x 

y — 3z = -- A= 2Y 4+ 2 = 3y 
28 Oy = OZ k Y—Z= z 
Find the value of & for which they are consistent, and obtain the general 
solution for X, Y, Z in that case. Give a geometrical illustration. (M.E.1.) 


I 


7 Solve where possible the equations 


LY «Ὁ 2%, Τὰ a 
2 Al 21 y |= ὁ 
Ὡς —3 —16/ \z ¢ 
a 0 Ι Ἵ 
inthe cases |b] = (2) | OF, (1)[1], (az) | OF, (w) [1] 
C 0 ᾿ Ι0 Ι0 
᾿Ξ; 50 
8 What is the effect of the singular matrix [3 1 —1]} onthe points 
πε 9 


x 


y | of 3-D space? 
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g [ind a3 x 3 matrix which takes all the points of 3-D space into the 
plane 2x — 4y + z = 0. 


10 Find a3 Χ 3 matrix which takes all the points of 3-D space into the 
line of intersection of the planes 2x — 4y + z= Oandx+y—-—4z=0. 


2 —3 ] 
1 Show that the matrix 4 —6 2 
-3 τῇ 


takes all the points of 3-D space into the line x:y:z = 2:4: —3. What do 
you notice about the (1) minors, (11) rows and columns? 


*13.8 Further applications of matrices 


There are many applications of the use of matrices throughout mechanics 
and other branches of applied mathematics but we shall concentrate our 
attention on a group of problems involving the impact and collisions of a 
particle. 


Example 1 


The bouncing of a particle on an inclined plane 


Consider a projectile launched from a point QO, so as to land on a plane 
through O, inclined to the horizontal at an angle «. If the magnitude and 
direction of the initial velocity are known, it is possible to calculate the 
range r, that is the distance between the point O, and the point O, where 
the projectile hits the plane. It is convenient to work with the components 
of the initial velocity along the plane and normal to it. Calling these u and », 
it can be shown that 


2u 
nee 


£ cos & 


(u — v tan a). 


Of course, if wu < v tan a, r would be negative, and you should have no 
difficulty in appreciating the meaning in this case. 

Next suppose that the ball bounces after landing at O, after which it 
proceeds along another parabolic trajectory under gravity, and again hits 
the plane at a point O,, whereupon it rebounds and the process continues. 
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We may wish to compute the position of the successive bounces, and to 
know just what happens in the subsequent motion. 

The key to the problem is that we should like to be able to obtain the 
components (along and perpendicular to the plane) of the velocity of the 
ball as it sets out on each of its respective trajectories. For, once we know 
these, the above formula enables us to find where it lands at the end of that 
trajectory. 

If the velocity just after the first bounce has components u,, v,, it 1s 
easy to show that 


μη =u — Qvtane 
y 20, 
where it is assumed that the plane is smooth, and that there 1s a coefficient 


of restitution ¢ (0 < e< 1) governed by the elastic properties of the 
materials. These equations may be written 


"Ὁ" 


. l 
so that we see the matrix M = ( 


0 


velocity represented by a column vector into the velocity vector following 


-- tan ἃ a cae 
transforms the initial 
6 


the bounce. This we may write v, = Mv. Similarly, if v, = ( ἢ is the 
“2 
velocity immediately after the second bounce, we have v, = Mv,. Com- 


bining these, v. = MMv = M°v, giving the velocity after the second 
bounce in terms of the initial velocity. Similarly, v, = Mv is the velocity 
after the third bounce, and v, = Mv is the velocity after the nth rebound. 

Thus, to find out what happens to the ball after repeated rebounds from 


the plane, we need to be able to multiply the matrix ᾿ ren 3 
6 


by itself repeatedly, i.e., we need to find M 2M’, M*,..., or ‘powers’ 
of the matrix M. As an example, consider first the case when tan a = 3, 
e = | (perfect elasticity). 


Ι] --Ἰ - - 
Then M= : M? = ey, M? = ἜΣ Sec 
0 1 0 lf, 0 1 


bck ᾿ 1 -- 
and it is easy to see inductively that ΜΠ = ( ἢ 


: ΓΗ ; : 15 
Thus if the initial velocity were given by the vector v = ( 4 ) we have 


11 7 3 
v = Mv = [ἢ 2, = Mv = [ἢ v= Mv = [ἢ 
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—| 
v= Mv =(7'). 


The range formula gives successive values of 7 to be: 

r, = 13k; To = Qk; Yo = Ok; γᾳ = k and soon, where 
— vo 10 

 gcosa 9.8 x 2/,/5 


Evidently, after the third bounce, the ball starts coming down the plane. 
Had the coefficient of restitution been smaller (say +), our matrix would 
have been 


Ι --ἸἜῬ 1-15 1 --τῷ 
M = 6 J with M?* = : ᾿ Μ' = ( ᾿ and so on. 
2 4 8 


In this case, with initial velocity the same, we obtain 
Ly [15 1] 1 —15\/15 9 
ὑπ᾿ "τ “τῷ τῷ 
fea 15) (8ὃγ Ι —1¢\/15 71 
τῷ (τ «τῷ τ... 


and successive values of r: 
Pie 151: r> = 10k; rz = 84k; Pa ee 


Here we may well ask whether the value of r ever becomes negative — 
whether the ball ever leaps down the plane — before bouncing ceases for 
ever. 


Example 2 


Collision of elastic particles moving in a line 


We consider next another situation in mechanics, that of a number of 
billiard balls moving along a straight line, with impacts occurring between 
pairs of adjacent balls. We shall suppose that the balls are all of equal 
mass m, and that the coefficient of restitution between any pair is 6. Suppose 
the two balls have speeds wu, , uv, before impact and v,, v, after impact: 


Then, by conservation of momentum: στο ——$—<— 1, 
mv, + Mo = mu, + Muy | δ 
and by Newton’s law of restitution: 


Vo — vy = — (Uy _ Uy). —_———— Vv, ----.---..,κετὺ;, 
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Thus v, + vo = ἴ + Uy 


VU, — Vg = —eUy + Uy 
1 ol\ fay) 1 1)\ fa, 
ἀν )) 
ΝΕ ] 2 2 
Pre-multiplying by the inverse of 1-1) namely ti Se ᾿ 


fax 108 (0) Ε ( ὴ (μ᾿ 
Ι Ὁ 6) 3(1 —e)/ λμῳ q β᾽) \U2 


where p = 3(1 — e), g = 3(1 - δ), and p+ 4 = 1, or more briefly as 
v = Mu, where v is the column vector giving the speeds after impact, w is 
the column vector giving the speeds before impact, and M is the matrix 


: ) which converts velocities before into velocities after impact. 


(Note that ν is not a vector in the same sense that v was in Example 1. 
There, the two constituents of the column vector were the components of v, 
along and normal to the plane, so that v was an ‘ordinary’ vector, namely 
a velocity. Here, wv is simply a ‘list’ of the (signed) speeds of the two balls in 
the system. If there had been more than two balls, then a column with more 
than two components would have been necessary.) 

Now the two balls, having collided, will not do so again. But one could 
contrive to make them do so, for example, by arranging for them to 
rebound at right angles from two cushions as in the figure, further impacts 


ee 


occurring in the line of motion. Or the system may consist of tiny elastic 
particles inside a smooth circular tube, so that after colliding they rebound 
and collide repeatedly by moving round the circumference of the tube. 
The result of repeated collisions would be determined by computing 
successive powers of the matrix M: Μ΄, Μ', and so on. 
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Example 3 
Φ Φ Φ 
---.---.-.ο.»-- ---.-.... — 
uu, v ἴθι 


Repeated impacts of three balls moving in a line 


Suppose now we have three balls, A, B, and C of equal mass moving in a 
straight line, and that A first strikes B, after which B strikes C. We shall 
suppose the speeds to be u,, v;, w, initially, μι... v9, τ0 after the first impact 
between A and B, and finally us, v3, w, after the second impact. We have, 
as in the previous section: 


Uy = 2(1 — γι, + 31 + e)v, = pu, + φυῃὶ ἔμ [bp 4 uy 
vo = {1 + elu, + 5(1 — ev, = qu, + purrp>]v. J=l¢q p 0 v, 
Wo = ὦ] 100 0 0 10) 
For the impact between B and C we have similarly, 

Us = Uo Us 1 O O\ /u, 
Vs = 2(1 — δ)υ + Ἦ Σ(1 + δ), = fry + qua >], ]Ξ[ 0 p 9 V2 
094 = (1 + e)vg + o(1 — ἐδὼ = φυ + py W3 Og p]} λιν 


Abbreviating the notation: 
v, = M,v, 
V3 = M,v, 


and these when combined give the final velocities in terms of the initial 
ones:v,; = M,M,v,, where the product matrix M,M, will be found to be 


p ge 
pq ρ΄ 4 
g pq p 


This matrix describes the final result after the two successive collisions. As 
expected, M,M, #4 M,M,, so the final result would be different if the 
collision between B and C had preceded that between A and B. 


Example 4 


Perfectly elastic collisions 


The case ὁ = | deserves special attention. Here we have 
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0 1 0 1 0 0 
M,={1 0 0|, M,={0 0 1 
00 1 010, 


and these are permutation matrices, confirming the fact that the pairs of 
colliding balls exchange velocities upon impact. ‘The diagram 


A B C 
ore Wy 
V1 uy Wy 


shows the velocities after successive collisions between A and B, B and C, 
and a further collision between A and B, in the case when u, > v, > w,. 
Evidently no further collisions can occur because after the third impact 
both pairs of adjacent balls are separating, with u, > v, and v, > wy. 
It is interesting to extend to the case when there are 4 (or more) perfectly 
elastic balls moving in a line. The matrices will then be 


0 1 0 0 
1 0 0 0 
for an impact between A and B; 
0 0 1 0 
0 0 0 1 
1 0 0 0 
0010 
for an impact between B and C; 
0 1 0 0 
0 0 0 1 
000 
OF 1000 
and for an impact between C and D. 
0 0 0 1 
0 0 1 0 


The order in which impacts occur will depend upon the initial position 
and initial velocities of the 4 balls. If 6 impacts occur in the sequenc 
AB, BC, CD, BC, AB, BC, the velocities follow the sequence: 
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A e 
A B Cc OD DRL σ 
τὰς a 
u v Ww Ζ -ς ς 
x i 
ae ye 
v w u Z +- wer 
ὦ a | | | ! 
ἢ. οὐδέ, ar C ΚΟ ἢ 
΄ 1 | ] 
΄ | ! | | 
υ Ζ w u Z ΤΣ 
» | | | | A 
΄ Ι | | 
Ba. | | | | | 
Z v w u or | Ὄπ ae 
x SA Ce 
Z we vu | Pepe τ 
A | Poe “15 4] 
Ι Poy ft | 
Ι ΠῚ Ι 
ἘΠ: Ξε 
| | ΕΣ ι | 
t 
Ist 2nd 3rd 4th5th 6th impact 


the final velocities z, w, v, τι being the original ones in the reverse order. ‘This 
is shown graphically in the figure where a displacement time graph is 
given for each ball. At each impact, the two bodies ‘swop graphs’. Clearly, 
since four lines intersect in at most six points, there can be no more than 
six Impacts under the most favourable conditions, with u > v > w > z. 
This may obviously be extended to the case of five balls moving in a line, 
when the maximum, number of possible collisions will be 10 (= °C,). 


Example 5 


As a final example, which may be answered by the use of matrices and 
which would be very difficult by any other means, suppose one considers 
the following problem. | 

A body is first given a rotation through 180° about an axis, and then a 
quarter-turn (through 90°) about a second axis, the two axes intersecting 
at a point O and making an angle θ with each other. It seems a reasonable 
assumption that these successive rotations may be replaced by a single 
rotation about some other axis through O. We now ask whether it is 
possible for the angle 6 between the axes of rotation to be chosen in such a 
way that the single rotation will be through + 120°. In other words, can a 
half-turn, followed by a quarter-turn, ever lead to a one-third turn? 

There are probably not very many people who can visualise the solution 
of such a problem, as it would require much more three-dimensional 
insight than any but the most gifted possess. Even if one were allowed to 
use apparatus — rods, matches, pieces of string, cardboard, etc. — one 
would still be hard put to it to obtain a solution. The reader, trusting his 
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intuition, may well wonder whether the problem is soluble, and may like 
to perform experiments to discover whether his intuition has misled him 
before reading on. 

Using matrices, however, the solution is comparatively simple. We are 
concerned with rotations about two axes intersecting at an angle 0. We 
therefore choose our coordinate system so that the origin is the pdint of 
intersection of the lines, one of them is the axis of x, and the other lies in the 
plane of the x- and y-axes, i.e., the plane z = 0, so that it has equations 

= 0,y = xtan 8. 


Ζ 


axis of NS axis of 
second rotation Ν first rotation 
x 


Our first rotation, through 180°, will be about this axis, and in order to set 
up the matrix, we must enquire the destinations of the points U, V, W 
under this half-turn. Clearly the point W’ has coordinates (0, 0, —1), so 
the third column of the matrix is determined. 


ψ 


A ‘plan’ view on to the xOy plane shows that the half-turn takes the point U 
to U’ with coordinates (cos 20, sin 20, 0), while V’ (remembering that 
ZL U’OV’' = 90°) will be found to have coordinates (sin 20, —cos 20, 0).t 


+ Compare the matrix for the two-dimensional reflection in a line through the origin; see 13.2. 
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Thus the matrix representing the half-turn is 


cos 26 sin 20 0 
A={sin20 —cos 20 0 
0 0 --Ἰ 


Since our second rotation is about the x-axis, 1{ will be easier to find the 
matrix, but we must specify in which sense the quarter-turn 15 to be carried 
out (this did not matter in the case of the half-turn, of course). If the sense 
of the rotation is that of a right-hand screw in the positive direction of the 
x-axis, 1.e., such as to carry the point V to move into coincidence with the 
point W, then we can easily see that U’ is (1, 0, 0), this point being mmvariant 
under the rotation, V’ is (0, 0, 1), and W’ is (0, —1, 0). Thus our second 


matrix 1S 


1 0 O 
B=}j;0 0 —1 
O 1 O 


The combined effect of the two rotations will be represented by the product 
matrix BA (first A, then B, since the matrices are pre-multiplying column 
vectors), and we have 


1 O O\ (κοῦ 926. sin2@ 0 cos 2@ sin2@ 0 
BA=1{0 0 --Ἰ «ἡ 20 —cos20 O]= 0 0 Ι|Ι|--ὄἘσ 
01 0 0 0 —] sin 920 —cos 268 0 


The question is: can Ο' conceivably represent a third-turn, i.e., ἃ rotation 
through +120°? In order to answer this, we first consider the matrix C αἱ 
which should represent a rotation about - 240“: Writing ¢ = cos 20, 
5s = sin 20, 


C s 0 C s 0 C CSO 
C7 =10 0 1 0 Ὁ 1}, = 5 -c 0 
2 

s —c 0 s —c 0 CS St C6 

ce es 5 ὃ s 0 ae ae oe nk Vo A δὲ 
2 

and C*=Is -c 0 0 0 1}}-Ξ CS 5 —¢ 
ἐξ og? ποῦ e-Se-. 10 ἐκ SERS ae A 


Now if C is to represent a 120° rotation, Οὐ must be the identity matrix, and 
the equation C’ = J would mean that the nine equations: 
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eos = 1 ἢ c*s —cs = 0 (it) cs =O (1222) 
cs =O (ὦ) ca τ) —c=0Q (v1) 
ὅς —cs = 0 (vit) 052 - οὐ =0 (vii) SS 1 τὰ 


ought all to be satisfied. It may well be that there does not exist an angle 8 
which satisfies all these equations. However, our fears on this score are soon 
dispelled. The key to the solution is equation (vz), requiring that c = 0, 
in which case (11), (a2), (av), (vez) and (viz) are automatically true. The 
remaining three equations (7), (v), and (2x) then all reduce tos* = 1, which 
is consistent with c = 0, and we summarise our findings: 


cos 20 = 0; sin 20 = +1. 


These are satisfied by θ = 45° (and by 135°, as well as other values of 6 
which need not concern us). Thus we have verified that a rotation through 
180°, followed by a rotation through 90° about an axis inclined to the first at 
45°, may be together equivalent to a single rotation through 120°. 

We may go further and use matrices to locate the axis of this single rotation. 
This is not difficult, for points on the axis of rotation are unchanged, or 


x 
¢nvariant, under the rotation. Thus if x =| y } is such a point, we have 
Ζ 
Cx =x 
0 1 0 
But C={0 0 1) (taking sin 20 = +1), 
1 0 0 
QO 1 O\ /x x y x 
andso [0 ὁ liftyJ=ly > ZU TY 
1 0 Of \z Ζ x Ζ 


and such a point therefore satisfies the equations x = y = z. Clearly the 
axis of rotation is a line making equal angles with the coordinate axes, and 
in fact passes through the point (1, 1,1). Note that the matrix Ο is a 
permutation matrix, its effect on the coordinates of any point being to 
‘move them round one place’ —a cyclic permutation —- whereby x is 
replaced by y, y by z, and z by x.-Evidently this is geometrically a rotation 
through 120°. 

If your experiments with matches and string failed, you may now care to 
demonstrate the geometry of the above by using a cube as shown overleaf: 


The diagram should be self explanatory, and you may of course perform 
the experiment with an ordinary die. 
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Net of cube 


Quarter-turn 
axis 


Half turn 


axis 


Exercise 13.8 


1 Find the condition on e for three particles of equal masses moving in a 
line as in example 3 to collide more than three times. Is there any limit to 
the number of collisions that are possible when ὁ is small? 


2 Whenarotation through 27/p about one axis through O is followed by a 
second rotation through 2π|4 about another axis through O, the effect 1s 
that of a single rotation through 2z/r. In example 5 we saw that if p = 2, 
4 = 4, it is possible for r to have the value 3. Use matrices to find other 
possible sets of values of p and g for which 7 is rational. 
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1 Oxyz make a set of mutually perpendicular right-handed axes in three- 
dimensional space. The transformation Q, is a quarter-turn about the 
ray Ox, its sense being clockwise if one looks out from the origin along the 
ray; and Q, is a quarter-turn about a ray from (0, 0, 1) parallel to Oy, 
clockwise if one looks out from (0, 0, 1) along the ray. Find the images of a 
general point (x, y, Z) under Q, and under Q,, and hence find the image 
of (x, y, Z) under Q,Q, (i.e., Q; followed by Q,). Prove that the trans- 
formation Q,Q, has no fixed point. 

Give the equations of four planes with the property that M,M, MM, = 
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Q.Q,, where M, denotes the reflection in the 7th plane. Explain why 
Q.,Q, cannot be expressed as the product of fewer than four reflections. 
(S.M.P. ) 


2 -Square matrices A, B of the same order have the property that ATA =I, 
B™B = I (where A’, ΒΓ denote the transposes of A, B, and 1 is the unit 
matrix). Prove that: 

(1) IfC = A’, then C'C = J; (1) If D = AB, then 21} = 1 
What can you deduce about the set of matrices of the given order such that 
XX = 1? (S.M.P.) 


3 Aisanon-singular 3 x 3 matrix; A? denotes the transpose and A” ° is 
the inverse. Prove that: 

(AB)? = B™A™ andthat (47) } = (A7')’. 

Denote by A* the matrix (A’)~ *. Operations ¢, 7, 5, ¢ are defined on the 
set of all non-singular 3 x 3 matrices by 

e(A) = A, r(A) = A*, s(A) = AM," (A) =A’, 

If tr denotes the composition ‘first 7, then ¢’, prove that tr = s, justify the 
other entries in the composition table, and complete the table. 


t ἐ 5.1 6 (Μ.Ε.1.} 


4 (A statistical application of the transpose of a matrix.) 

Suppose we have two variables x, y, and a number of observed pairs of 
values have been recorded: (x;, y;),7 = 1, 2,..., π΄ It is supposed theoreti- 
cally that the points ought to lie on a straight line, whose equations we will 
take to be y = mx + c. We cannot of course satisfy all the n equations, 


x4 ] {1 
] 
Μι = MX; + C, ΟΥ̓ : a = 42 
Φ . C e 
x, 1 Yn 


for the two unknowns m and c. However the best values of m and ¢ according 
to the ‘principle of least squares’ may be found by pre-multiplying by the 
transpose of the n X 2 matrix, thus: 
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This gives 
x4 l ΓΕ 
Χ] Χο ... χρλί XQ. 1} {πηι _ (*%1 *%2 +++ ὅΖηὴ] 72 
to ΩΣ ΑἸ, ἅ age 1.1: 
Xn ] Yn 


Proceed to show that the values of m and ὁ determined by this equation are 
those which make the line pass through the mean centre of the n points, 
and have gradient m, where 

_ oxy 
m= τ 
(see gradient of regression line of y on x, p. 354). 
Find the equation of the regression line of y on x through the five points 


[5.9 sail), τ" 3), (2s 2s (5, 6), (8, 9). 
5 Solve for x, y, z the equations: 


x+2y- z= 4 
oe Sy On ΞΞΞ. 2 
ox — Sy + 72 ΞΞ --᾿ἸῬ. 


Describe geometrically the intersection of the three planes given by these 
equations. (S.M.P. ) 


6 Write ashort essay on the solution of three simultaneous linear equations 
using the method of pivotal condensation. Include in your essay: 

(1) at least one example to illustrate the method, 

(11) a brief description of the nature of ‘ill-conditioning’. (M.E.I. ) 


7 What condition must be satisfied by the constants a, ὦ, c, ἰ, m, and nin 
order for the system of equations 


— ny +mz=a 
nx —lz =) 
—mx + ly =¢ 


to have solutions? 

If this condition is satisfied, show that any solution of the equations will 
then also satisfy the additional equation ax + by + cz = 0, and write 
down the most general solution in this case. 

Do any of these solutions also satisfy the equation /x + my + nz = 0? 


8 A 4 x 4 determinant contains 24 terms. When there is a single zero 
among its 16 elements, there will be 24 — 6 = 18 non-zero terms. Find 
the greatest and the least possible number of non-zero terms when three 
zero elements are present, showing where they may be placed. 
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9 Ifnisa positive integer, show that ὁ — c,c — a,a — bare factors of the 
determinant 


] ] Ϊ 


a b C 


n+2 pnt 2 n+2 


a C 


and that the remaining factor is the sum of all products of a, ὁ, ¢ of degree n 
(with repetitions of a, ὁ, c included). 
Obtain the corresponding result when the last row of the determinant is 


replaced bya ",b°",c°". (0.C.) 
MO α-Γ 1 1 

Ι0 Giventhat [αὖ x7 1 I/=0, 
Ι x7? +a? | 

show that x = ], or x = a, or x = —a/(a + 1) will satisfy the equation. 


Are there any other roots? 


1r Prove that the determinant A, of order n 


9 cos 0 ] 0 0 
] 2 cos 8 ] 0 0 

0 ] 2cos@ 1 0 

1 2cos8@ ] 


0 Ϊ 2 οο5θ 
satisfies the recurrence relation A,,, = 2A, cos θ — Δ,.- 1. 


Prove that, if @ τὰ ἀπ, where k is an integer, then A, = acre 
sin 


If the term in the top left-hand corner is changed to cos θ, what is the value 


of the determinant? (0.c.) 
ab 0 0 
ab O 
bab 0 
12 Expand the determinants {5 a ὁ). : 
0 b a ὁ 
O ὁ a 
0 0 ὃ a 


and continue with similar determinants of order 5, 6,..., etc. 
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Show that the determinant 


0 
Ι 
Ι 
Ϊ 


oO Fr - 
Oo = — 
—- — Φ © 
—_— CO O&O © 


must have the value 0, 1 or —1. 


14 


Introduction to algebraic 
structures 


14.1 The idea of a structure 


In 9.5, we proved that, if A, B, ὦ, D are four points in space such that 
AB 1 CD and AC L BD, then AD 1 BC. This was done by using the 
scalar product of vectors, and during the course of the proof we made use 
of the distributive property of scalar product, 


a.(b —c) =a.b — ave, 
whereby vectors behave like ordinary numbers in the relationship 
a(b — c) = ab — aa. 


The property of distributivity of scalar product over subtraction was not 
something we could take for granted, but had to be established, and it was 
essential that our dependence on the result should be acknowledged in the 
course of the proof concerning the orthocentric tetrahedron. Once dis- 
tributivity has been proved, however, we have available this powerful 
vector tool, enabling us to produce a very concise and elegant proof. 

This draws attention to the importance of what has come to be called 
structure, which (roughly speaking) refers to the various rules which the 
elements of a set — the set of three-dimensional vectors in this case — 
have to obey in order that the system may behave itself in a manner which 
avoids inconsistency or contradiction. We know of two methods of ‘ multi- 
plying’ vectors — the scalar, and the vector, products.t These are good 
definitions precisely because they lead to an orderly structure. One might 
try to invent other ways of ‘multiplying’ vectors; e.g., AB x CD = the 
product of the lengths AD and BC. Such a definition will lead to chaos! 
+ Note the structure of grammar here: ‘the scalar and the vector products’ is short for ‘the 
scalar product and the vector product’, so that the word ‘ products’ distributes over ‘and’ in 


this case. Again, in ‘W. D. and H. O. Wills’, the surname distributes over the conjunction 
‘and’; so too in ‘Mr and Mrs Wills’. 
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For a start, if AB and A’B’ were ‘equivalent’ vectors (having equal length 
and the same direction). the ‘product’ A’B’ x CD would be different 
from the ‘product’ AB x CD, when we feel we have a right for it to be 
the same. In the second place, AB x DC would be AC x BD when one 
would rather expect it to give the negative of AB x CD. It is true that 
AB x CD = CD x AB, i.e., our ‘product’ would be commutative. But 
this is by no means essential, since such a useful definition as vector 
product (see 11.5) 1s in fact not commutative. Much more important, we 
should no longer be able to claim distributivity of x over + and —, for 
AB x (CD + DE) = AB x CE = AE x BC; whereas AB xX CD + 
AB x DE = AD x BC + AE χ BD, and these are by no means equal. 

The simpler features of structure such as commutativity and associativity 
may appear to be of little interest — the smallest child can quickly learn 
to appreciate that 8 + 3 = 3 + ὃ (1e., that addition is commutative), that 
2x (3 x 5) = (2 x 3) x 5 (ie., that multiplication is associative) and 
so on, and the realisation of these fundamental laws quickly becomes a 
matter of habit. Even so, the question of commutativity is not trivial and 
loses its aura of inevitability when one considers non-commutative opera- 
tions, such as subtraction and division of numbers and the multiplication 
of matrices. In the case of subtraction, the small child may become con- 
fused if asked ‘By how much does 253 fall short of 527?’ and commonly 
enquires about which number to put at the top. 

The idea of distributivity, involving both the operations of + and X, isa 
little more complicated. A young child may appreciate it when dealing with 
a problem like, ‘How much profit is made by a man who buys 50 articles 
at 28p each and sells them at 40p each?: 


Profit = 50 x 40 — 50 x 28 = 50(40 — 28) = 50 x 12 = 600p = £6, 


and repeated practice in this sort of situation eventually imprints on his 
mind the idea of distributivity of multiplication over subtraction as being 
normal and inevitable. It is possible that he may in some other situation 
make a false assumption about distributivity, as for example if he says 
(ab)c = ac X bc (making the assumption that multiplication distributes 
over multiplication) ; or that (a + b)* = a? + δ΄, which wrongly presumes 
that ‘squaring’ distributes over addition. Indeed a very common error is 
what may humorously be called the ‘universal law of distributivity’, 
co(a+ b) =cca+ccb, where © is some operation; e.g., “exponentia- 
tion’: 2** = 2* + 2°(!) or‘sine’: sin (x + y) = sin x + sin y(!), heresies 
not unknown in school work. One of the virtues of studying structure 1s, 
hopefully, that this type of ‘structural’ error is less likely to be perpetrated. 

It is interesting to trace the structural implications in a simple arith- 
metical calculation, such as the multiplication 24 x 3: 


For the formal layout 24 
x 3 


72 
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is essentially an abbreviation of 
24 x 3 = (2x104 4) x 3 

= (2x10)x34+ 4x3 

= (2x10)x3 4 12 

= (2x10)x3 4+ (1x10 + 2) 

x (10x 3) + (1x10 + 2) 

2x (3x10) + (1x 10 + 2) 
(2x3) x10 + (1x10 + 2) 
6x10 +4 (1x10 + 2) 
(6x10 + 1x10) τ 2 

= (641)x10+ 2 

=7x10+4+2 

= 72. 

You should check in this analysis the stages at which you have used the 
associative, commutative and distributive laws of addition and multi- 
plication, and then be grateful-that we do not usually need to take such a 
scrupulous view of elementary arithmetic! 

Instead of building up our idea of structure from primitive beginnings, 
let us now approach it from the starting point of a very highly organised 
structure — the field. The most familiar example is perhaps the set of 
rational numbers, Q, with the operationst of addition (+) and multiplica- 
tion (x), obeying the following basic requirements, which constitute the 
axioms for a field, (where it is to be understood that x, y, z denote any choice 
of elements from our set). 


I 


(x+y) +z=x+ (y+ Ζ)Ὶ Associative law of addition 
x+y=ytx Commutative law of addition 


x+O0=x=0+% Existence of identity element 
(zero) for addition 

x + (-x) Ξ ὁ Existence of inverse under 
addition (the ‘negative’ of x) 
Associative law of multiplication 
Commutative law of 
multiplication 


Existence of identity element 
(unit) for multiplication 

(x # ΟῚ) Existence of inverse under 
multiplication (the ‘reciprocal’ 
of x) 


xX (y+z) =x x y+-x x z Distributive law 


t+ i.e., closed operations (see p. 478). 
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It will be seen that the first four rules (Α1-- 44) are concerned with addition, 
M1—Mé4 with multiplication; D is the distributive law of multiplication over 
addition (to which we have chiefly referred) and which relates addition 
with multiplication. These laws govern the behaviour of the rationals 
under the ordinary manipulations of algebra: subtraction is taken care of 
by the application of Al, A2, A3 and A4. For if we ask ‘What must be 
added to x to obtain y?’, we can easily show that 


K+2=y > -x4+ (*+z)= —-x+y 


=> (-x+xe)4+z2=—-—x+y 
=> O+z=--x+y 
=> z=—-x+y=y+ (-- αὐ. 


In the same way, if x x z = y, we may show that z = «ly = yx’, 


and this is a formal way of writing the division y + x. We may loosely 
describe a field as a number system in which addition, subtraction, multi- 
plication and division may all be carried out, with the exception of division 
by zero. Of course, it may be that some of the laws for a field as given 
above may be redundant: it is possible, for instance, to dispense with the 
second half of A3 and derive it from the first half of A2. When there are no 
such redundancies, we should have what may be called a minimum set of 
axioms. 

The above was chosen as an example, first because the field is one of 
the fullest and most satisfying structures, and also because the rationals are 
so familiar. You will readily check that the reals R form a larger field (of 
which the rationals Q are a sub-field), and the complex numbers (ὦ again 
form a field which contains the reals as a sub-field. Do not imagine, how- 
ever, that fields are necessarily infinite: it is common to have finite fields, 
and you may care to show that the two numbers {0, 1} form a field when 
addition and multiplication are defined, modulo 2, thus: 


Clearly, however, the integers Z do not form a field, since multiplicative 
inverses do not exist for numbers other than +1 or —1. So M4 is not satis- 
fied and the integers with the operations + and x forma ‘lower’ structure, 
known as a ring. By contrast, there do exist even more highly organised 
structures than fields, e.g., vector spaces and modules, but these need not 
concern us here, and we shall concentrate upon the most important of all 
mathematical structures, known as a group. So central is this that its study 
forms the backbone of the present chapter, but first we must discuss more 
fully the idea of a binary operation. 
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Exercise 14.1 
1 Consider the long multiplication 


2.7. 5 
79 


2475 
1 9.2. ὃ 
2d 2 2.9 


and analyse the uses of the field axioms in the course of the work in the 
same way as was done in the case 24 x 3 = 72 above. 


2 Revise the proofsof a.(b+c) =a.b+a.c 
and ax(b+c)=axb+axXe, 


where a, δ. ¢ are vectors in three dimensions. 


3 Show that {0, 1, 2, 3, 4} is a field under addition and multiplication 
modulo 5, zero being excluded in the case of multiplication, but that 
{0, 1, 2, 3, 4, 5} cannot be made into a field with addition and multiplica- 
tion modulo 6, even when zero is excluded. 


14.2 Binary operations 


In the set of axioms for a field given in 14.1, we were dealing with a familiar 
example in which x, y, z represented numbers. One of the deficiencies of 
traditional school mathematics was the impression given that letters always 
stood for numbers. But here you have already encountered many examples 
of the use of letters for other entities — sets, matrices, rotations, points, 
vectors, permutations, lines, curves, ordered pairs, and so on. We may 
now ask whether sets of such objects may be organised into structures. 

That the answer to this question must be ‘yes’ you cannot have doubted. 
But whatever the nature of the elements of the set which is being provided 
with a structure, the fundamental step to such provision 15 the idea of a 
binary operation or law of composition, by which we can take any two elements 
of the set and combine them in some way, according to some rule. Thus, for 
example, multiplication is a binary operation on the set of integers, for 
when one takes two integers such as —3 and 2 and combines them by this 
particular binary operation, one obtains the integer — 6, which is also in Z, 
and we write (—3) Χ (+2) = —6, the binary operation of multiplication 
being represented by the familiar symbol x. More generally, if the binary 
operation is represented by *, we may describe the result of combining 
members x and y of the background set S formally as follows: 


Z=NRY 


and this states that the result of applying the binary operation * to the two 
elements x and y of Sis to produce the element z. If (as in the multiplication 
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of integers) the result of an operation is always in the original set S, then 
we say that our operation is closed; but if there is some z which is not in S, 
then the operation is not closed, as example, S = Z (the integers) and * is 
the operation of division. Then (—3) * (+2) = —135. But —14 is not in 
the set Z, so the operation + on the integers is not closed. Again, suppose ** 
denotes ‘exponentiation’ (as in the computer programming language 
Fortran), then (—3) κα (+2) = (—3)* = +9. However, (+2) ** (—3) = 
2. 5 = %, and this operation would be closed if our set had been the reals R, 
but is not closed when we confine our attention to the integers Z. (Note, in 
this case, that we must be careful to exclude the case x = y = 0, for no 
value can be assigned to 0°). 

One may also describe a closed binary operation on a set S as a mapping 
from ordered pairs of the set into members of the set itself: (x,y) ΕΣ z 
where x, y, z € S. If however, the binary operation is not closed, the image 
may be outside the set, as for example in the case of the operation . on the 
set of vectors, which is a mapping from the ordered pairs of vectors into the 
reals, R. 

It is important to realise that by the use οἵ ἃ binary operation, an element 
may be combined with itself, and that x * x must be defined for all x in S: 
e.g., if § = Rand κ is multiplication, then x * x = x?; again, if S = Z* 
and * is the operation of finding the h.c.f. of x and y, then in the case 
x = y, we have x * x = x for all x in Z™ (since the h.c.f. of a positive 
integer and itself is that same number). Moreover, if z = x * y, it is per- 
fectly possible for z to be equal to x or y. For, if ** is exponentiation, 
3** 1 = 3, and 1 ** 3 = 1. 


Linear transformations 


We now consider an example where the elements of the set are not 
numbers but two-dimensional linear transformations, such as rotations, 
translations, reflections, shears, or enlargements. Given two elements in 
the set of all such transformations, how do we define a binary operation ° on 
them? Simply by considering the composition of the movements, one after 
the other. So if 7’, and JT, are two transformations, we shall denote the 
result of first performing the transformation 7, and then performing 7, 
by 7, ° 7,, the reason for this order being our similar practice when 
linear transformations were represented by matrices (see 13.2). Suppose 
now that R, represents a +90° turn about O,, and AR, represents a + 60° 
turn about point O, in the plane of the rotations. Then in the caseO, = Op, 
R,° R, evidently represents a +150° turn; and also, clearly Ryo R, = 
R,° R,. In the case when O, and O, are distinct, intuition tells us that 
R,° R, isa +150° turn, but it is not immediately clear about which point 
the rotation is centred. A little experiment should however convince you 
that Ro oR, AR, oR. 

Again, suppose our set of transformations consists of all possible rotations 
in the plane, through any possible angle about all possible points. In view 
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of the above informal discussion, it may seem that this set is closed under the 
operation ο. However, this is not so, and it is not difficult to produce a 
counter-example: consider a positive 90° turn about A followed by a 
negative 90° turn about a different point B: 


It is clear from the figure that the composition of the two rotations 15 not a 
rotation, but a translation 7, for after rotation through two equal and 
opposite angles, the orientation of the figure is restored — the sort of 
manoeuvre might be executed in the course of a dance, at the end of which 
the dancer would be facing the same way. Evidently, the composition of 
rotations about different points through equal and opposite angles will 
always be a translation, so that the set of plane rotations is not closed under 
composition. Closure may be achieved, however, by adding the plane 
translations to give a set of transformations comprising both rotations and 
translations. Notice, however, that the subset of translations alone 15 closed 
within itself. 


As another illustration, consider the half-turns (180° rotations) of a 
cuboid about its three perpendicular central axes, and let us call these 
X, Y, and Z. Here is a set containing only three objects, and we consider 
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whether we have closure. A simple experiment with a matchbox will con- 
vince you that Yo Y = Yo X = Z, with four other similar relations. But 
our hopes of closure are dashed when we remember that Xo X is neither 
X nor ¥Y nor Z, but has the effect of restoring the original position of the 
box. This is the identity transformation, which may be called J, and it must be 
included in our set if we are to have a closed system. The table below shows 
the way in which our four transformations combine: 


Permutations 


As another illustration of a set with a binary operation, we consider the 
composition of permutations, a most important idea in mathematics. 
A finite permutation is strictly a mapping from a finite set on to itself, such 
as 


ame 


Here there are five elements A, B, C, D, E, and the arrows indicate that the 
element A is ‘mapped’ into D, D is mapped into ἢ, and F into A, while 
Band Care interchanged. It is perhaps convenient to think ofa permutation 

: : 48 ΟΡ Ε . : 
as a ‘rearrangement’, and to represent it as eS CBE A! Calling this 
permutation fp, suppose now we wish to combine it with another permuta- 

ABCDE 

CEBD εἰ 
Consider the effect of first performing the permutation ῥ, and then following 
this by the permutation qg. The effect will be that A will be replaced first 
by D, and finally by D; B will be replaced by C, and subsequently by B, 
and so on. The final effect of the composite permutation arising from the 


ABCDE\,.., 
DREAC! δ 5 


tion g which replaces the letters according to the scheme ( 


successive application of first p then g will be thus: ( 
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Ἢ this permutation r, and adhering to the rule of writing the first operation 
on the right, we may write 


(A BCDE 
ON Re Ὁ 


where represents the binary operation of composition of the permutations. 
You should verify that 


ABCDE 
Sz pog= 
LN eae iy)? 


showing that composition of permutations is not commutative. 


Composition of functions 


As another example, we may consider types of binary operations on the 
set of functions. 


x 


2 =x 


If f(x) =3—-x* and g(x) = 


are two functions, then one simple way of combining them is by straight- 
forward addition: 


x 6-444 χ' 


Vee Oe 
f(x) + g(x) = 3 - + = 


ς 


this being denoted (f+ g)(x). 
Again, if f(x) = Inx and g(x) = sin x, the function f+ g would be 


In x + sin x. However, we are perhaps more familiar with the composition 
of functions by successive substitution, (f° g)(x) being defined as f[g(x)] 
so that in the latter example above we get the familiar In (sin x), whilst 
in the former case, 


x 6 — 4x 


Hak λα, a a σον, 


Note that this operation ο is not commutative, for 


eo SV06) τοῖς =F # 90. 


Exercise 14.2a 


1 Consider the set of permutations of five letters which interchange two 
pairs, such as (AB) (CD), which changes ABCDE into BADCE. Is this set 
closed under composition of permutations? 
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2 Is the following a valid binary operation on the set of rationals? 
αι at+e 
—*-— = ——— where α, b,¢, dé Z. 
bd b+d “~* 


Is the following a valid binary operation on the set of points in a plane? 
Given P, Q, P * Q is defined as the mirror image of P in OQ, where O is 


a given fixed point. 


3 Find which permutation(s) must be added to the following set in order 
for the set to be closed under composition of permutations: 


123 4 5 6 
1. τ, 5.5 Ὁ 
54 26 58 
23 4 1 6 ὃ 


4 Show that the set of 3 x 3 matrices which have positive determinants 
are closed under matrix multiplication. Are they closed under matrix 
addition ? Consider also the set of matrices whose determinants are (1) nega- 
tive, (11) zero. 


5 Consider the set of 120° rotations of the regular tetrahedron about its 
altitudes. Is this closed under successive composition ? 


6 Is the set {1, 3,5, 9, 13} closed under multiplication mod 14? Is there 
a subset which is closed? 


7 Consider the closure or otherwise of the following number systems 
under the operations +, —, x, +: 

(ὦ Ν; (i) Z; (ii) Z~; (ὦ) Q; (ἡ) Qt; (vi) Ry (vii) 
(vz) C; (ix) theirrationals; (x) cos? +isin@ (0 < θ < 2n) 
(xi) xt+iy (x,yeZ). 


Ἔς 
Κ΄; 
5 


8 Is the set of 2 x 2 stngular matrices closed under matrix addition, 
under matrix multiplication? 


g Inthe following cases, find (f+ g), fog, g°f,f? and σ΄: 
] l 


@) fe)=— gle)=——3 (γὼ = πα, g(x) = sing; 
τ χ —- 3 
Git) Γι) = SF als) = -αὶ 


(w) f(x) = 2x —1, g(x) =5 -- Qx. 
10 Is the operation x * y = x” closed in (1) Z*, (ἢ) Z; (wii) Q; (wv) R*? 


1: Show that functions of the type f(x) = (ax + b)/(cx + d) are closed 
under function composition. 
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12 Construct a finite set of 2 x 2 matrices which is closed under matrix 
multiplication. 


14 Consider the operations on vectors: (1) vector addition; (11) scalar 
product; (ii) vector product. Which of these are closed and which not 
closed (a) in two-dimensional space; (5) in three-dimensional space? 


y 
x 
are real. Investigate whether, for the operation of matrix multiplication, 
(1) Sis closed; (71) the identity belongs to $; (111) every element of S 
has an inverse. (S.M.P. ) 


x 
14 ὁ. denotes the set of matrices of the form ᾿ ) where x and y 


Associativity and commutativity 


Once we have a set with a closed binary operation (or possibly more than 
one operation), we have what may be called an algebra. If the existence of 
the operation is the only structural feature, then the algebra is very primi- 
tive. But usually the operation satisfies a number of axioms which make the 
structure more interesting. 

Generally speaking, the most desirable property, apart from closure, of a 
binary operation is that it should be associative, that 1s 


Va, ὁ, ςΕ ὁ. (ax δ), τὸ ΞΞ α κ (ὃ κα ὁ). 
If this feature is lacking, the structure is unlikely to lead to fruitful mathe- 


matical results (though the operation of vector product is a notable excep- 
tion). On the other hand, commutativity, 


Va, ὃ ε 5. axb=bxa 


can often be dispensed with, and many examples of non-commutative 
operations which are nonetheless mathematically powerful have already 
been encountered. 

It is usually a simple matter to see whether an operation 15 commutative 
by inspection, but often a laborious process to check for associativity. For 
example, consider the binary operation on R defined by the formula 


VU; + Vo 
a ae 2 
1 + v,0, 


Vp Ἐν = with |v,] «1, |v.| < 1 


(a formula used in special relativity theory). We wish to show that * 1s 
associative, i.e., that 


(0, * Vg) ¥ Vz = Vy * (Vg * U3). 


Sipe Mal ee, 

Now (νι * Uo) * piganes πα. Ὁ ὶ 
v Ve = --τ- τ t ), = oO 

Ns Ἶ 1+ vv : VU, + υοὶ)ῦ 

1%2 i 2)¥3 

1 + νῦν. 


Uy + 0g + 03 FH 010903 


1 + υιῦ. + 0103 + νοῦς 
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Vo t+ V3 
Ἐν i+ bo 
and Uv; * (vo 5 V3) = ν᾽, * ug 3 ἌΣ 
1 + υοῦ9 eas V1 (V2 + U3) 
Ὄπ + dg04 


_ 01 Ἔ 0g + vg  υἱνοῦς 
lL + υυῦν + 0,05 + ov, 

so the operation is associative, as well as being obviously commutative. 
On the other hand, the operation x ~ y = [x - y| is commutative but 
not associative, as may be confirmed by a single counter-example: e.g., if 
Oey) Ya Ty, 2S, 
then (x~y)~z=|2-1]~4=1~4=|4- 1] = 3, 
but χα (y~z) =2~ [1 -4/=2~3= [2 - 3) =1. 


Usually there is no short cut to discovering whether or not a particular 
operation is associative and it cannot be ascertained by inspection. How- 
ever, one should remember that associativity is always guaranteed in the 
following cases: 

(1) multiplication and addition in a number system, including finite 
arithmetics (see 14.3) ; 

(22) composition of mappings (geometrical transformations, permutations, 


matrices, vectors, etc.). 


Exercise 14.2b 
I State whether each of the operations in exercises 14.2a, nos. 2, 10 are: 


(a) commutative; (5) associative. 


2 Discover which of the operations on the reals described by the following 
formulae are (a) commutative (6) associative: 


χῃ -- ἰ Ϊ — xy 
(1) y ag ay (x+y ) 
(1) xe¥y = k(x + y); (222 ) xey = |x — yl; (10) χὰ ἡ = χ; 
(υ) rey (x,y #0); (vi) xey =x x |yl; 


(vit) χ τῳ = χ᾽. 


2 Show that the non-commutative operation shown in the table 1s not 
associative : 
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4 Show that the operation * on the set {a, ὁ, ο, d, e} where products are 
given in the following table, is not associative: 


5 Show that, for vectors in three-dimensional space, vector addition is 
associative, but that vector product is not. 


6 Prove the associativity of the following law of composition for pairs of 

reals: 

(41541) * (χω, Yo) = (χιχῳ — YiY2.%1Y2 + X41): 

7 Show that the operation on ordered pairs of reals defined by 

(1,91) * (2, Y2) = (%1%Q5 Χο. + Y2) 

is associative, and find whether the operation on ordered triples defined by 

(X15 Y15 21) Ἐ (X25 Yo, Zo) = (M1 Ἔ χω Ἢ YoZ15 41 + Ys 21 F 22) 

is or 1s not associative. 

8 Show that the operation on R expressed by the formula 

xy th 
x+y 

is associative for all k, and suggest some interpretations for special values of k. 


χὴν (x + y # 0) 


g The operation ois defined on the number pairs A = (α,. 44) B = (ὁ... dg) 
so that Ao B = (a, + b,, a, + 6,) and A = B if and only ifa, = ὃ, and 
ad. = b,. Find whether the operation 15 associative. 

Find P such that A o P = A, and Q such that Ο ο A = A, and determine 
whether /, J can be found such that, for all 4, do J = Aand Jo A = A. 

Given that A* = (—a,, —a,), discuss the following: 

BoA=CoA 

=> BoAoA* = CoAc A* 
=> Bo (0,0) = Ce (0,0) 
=> B= C. (M.E.I.} 


Identity and inverse 


Before we are ready to talk about groups, we must consider two other ideas, 
those of identity and inverse. If a set S is provided with an operation * and 
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there exists a fixed element ¢ of S which combines with any element of S to 
leave it unchanged, i.e., 


Yxe S, x*¥e=x and e*x=x 
(right (left 
identity) identity), 


then ὁ is described as an identity element for the set. For example, when 
the operation is addition in the set of integers, the identity is 0; while for 
multiplication, it is 1. These are the two numbers, 0 and 1, which play 
special parts in the axioms for a field (see 14.1). For addition in the set of 


; ἐς ἣν 0 0 
2 Χ 2 matrices, the identity is ο oP the ‘null’ matrix; for multi- 


1 6 00 
plication it is ( ᾿ the ‘unit’ matrix. Note that [0 1 0] isa 
0 0 1 


left identity for the set of all 3 x ἡ matrices, but a right identity for the set 
of all n x 3 matrices, and both a left and right identity only for square 
3 x 3 matrices. Similarly, we have already encountered the zdentity trans- 
formation in a set of transformations, the zdentity permutation and the identity 
function. In the case of subtraction on the set Z (or Q or R), we have a right 
identity x — Ο = x (for all x), but no left identity; similarly there is a right- 
but no left-identity in the case of division and also exponentiation. When we 
speak of ‘the identity element’ in future, we shall be concerned with 
associative closed operations having an identity which is both a left- and 
right-identity. 

The concept of znverse is one that has already arisen in several situations 
during the course of this book. The general idea is that of ‘undoing’ an 
operation, 1.e., of finding the element y which combines with x to give the 
identity, thus 


x*¥y =e y*x=e 
(y 15 a right inverse) (y is a left inverse). 


For example, in the set Z with the operation + (the identity being 0) we 
have 


x+(-x) =0 and (—x) +x =0, 


so that —x, or the negative of x, is the inverse of x under addition. In the 
set Q with the operation x, (the identity being 1) we have x x 1/x = 1 
(x # 0), and the multiplicative inverse is the reciprocal, and written x~ ἐν 
We have considered the inverse of a matrix under matrix multiplication in 
some detail, and we use the same notation, M', to denote the inverse of 
a non-singular matrix M, satisfying MM~' = I = Μ΄ ' Μ (see 13.4). 

A similar notation is used in the case of inverses of geometrical transfor- 
mations, permutations and functions. For example, if R 15 an anti-clockwise 
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rotation through 40° about O, then the inverse of this, a clockwise 40° 
rotation about 0, would be denoted R7 *. Again, if x is the permutation 


123 4 5 6 
5 16 4 2 3; 
then the inverse permutation may be written 


5.»ΝβὈΨ16 4 2 3 ; 12 3 4 5 6 
é 23 4 5 7 ὉΓΙΘΙΘΘΙΠΙΟΥ x 5 6 4 1 3) 
and is denoted x7 '. 

You should check that the result of combining x with x~' either way 
round is to produce the identity permutation. 

Lastly, we have also considered (see 1.1, Book 1, Advanced Mathematics) 
the inverse of a function or mapping. Taking a particular example, 


l 
if fix be — we wish to specify the inverse f” ' (x). 
— x 


Pee 
ΠΥ 


Calling =y 


=> 2.0}. -—xy=x+1 


2y + 1 
=> x= ——— > 
y+! 
. a Ἶ 2x + ] 
so the required inverse function is ἢ "(x) = : ra 
Xx 


You should check that the ‘composition’ of the functions f and 7,7, ἦ, in 
either order, does in fact give the identity function 7:7 t x. 

Finally we notice that if x and y are two elements of a set which combine 
under an associative binary operation *, then the inverse ofx *yisy‘*x«x ἢ; 


this may be abbreviated by dropping the * and using the ‘juxtaposition’, or 
multiplicative notation of ordinary algebra: 


1 | 


(xy) 8 = γ tx 
The proof is straightforward: 


\ - 


(xy) (7 Χ. = x (y ya 2) = χ((ν ἢ) ἢ ΞΞ αὶ (ex ") ae aaa . 


(Note the repeated use of associativity.) 


-1 1... 


Similarly, (γ 'x~*) (xy) = 6, so that xy and ψ ‘x! are inverse elements 
(just as the inverse of the combined operation of putting on one’s socks and 
then one’s shoes is the operation of first removing the shoes, then removing 
the socks). Evidently the rule may be extended to 


ye). ἘΣ χὰ ete: 
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Exercise 14.2c 


1 Find the identity elements for sets under the operations ἡ and under U 
(see 0.2 and 14.6), and show that it is impossible to find inverses for these 
operations. Find an operation which does admit inverses. 


2 (1) What must be included in the set {4, 8} in order to achieve closure 
under multiplication mod 14. What is the identity element, and what is the 
inverse of 4? 

(11) Which is the identity element in the system {2, 4, 8, 10, 14, 16} under 
multiplication mod 18? 

(11) Show that {2, 4, 6, 8, 12, 14, 16, 18} is closed under multiplication 
mod 20, but has no identity. 


2 Find: 

(1) the inverse of 27 and of 12 under addition mod 40 of {0, 1,2, ..., 39}; 
(72) the inverse of 17 under multiplication mod 20 of {0, 1, 2,..., 19}; 
(11) the inverse of 53 under multiplication mod 100 of {0, 1, 2,..., 99}. 


4 Find identity elements where they exist in operations described in 
14.2b, 2, 3, 4, 5, 6, 7, 8, 9. 


5 Find the inverses of the matrices under matrix multiplication, and 
interpret geometrically where possible: 


af 3 ay. . (06 0.8). ee 
4) © > My a a " (| 7 


O O | —-5 -πΊ 38 
(v) Jl O OF; (v) eee ae |e 
0-1 0 Ib 2, Τῷ 


(vi) cos 80° —sin 80° 
n sin 80° οο5 80° / 


6 Find the inverses of the following functions: 


. l sig, ee Ste oe ae = ἦν τῆν, 
(a l= τ’ (11) πε, (111) πο", ΣῊ 
3x = 7 6s 
(v) αἷ: (vi) ——~; (wii) log, (log,x). 
4 ΞΞ ὃ 


7 Find the inverses of the permutations 


< ft 2-3-4 Sy a fl Oe Od, GREY 
Coa 4 51 4] CN ie. δ ἐδ. 1. ὦ ἃ} 


ΟΠ 1 23 4567 
προ οι Boy τ 5. 
1 1 


Verify in each case that xo x * and x οχ are both the identity permuta- 


tion. 
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8 Verify the rule (xy)~' = y~ ‘x7! in the case when x and y are the per- 
mutations 


ἢ 2345 wl ὩΣ Bas 
a Wh. ap ee eG ΠΥ 5 1.3.2) 
9 Let S be the set of 2 x 2 matrices with elements in {0, 1, 2, 3, 4, 5}. If 


AéS and A has an inverse in S, show that det A = 1 or 9. (J.M.B. ) 


10 Show that the set of matrices of the form 


abe 
c a 6b} (a,b,ceER) 
b ¢ a 


are closed under matrix multiplication, and that the identity 15 in the set. 
Under what circumstances do we have inverses? 


11 Let (Ὁ and © be two binary operations (rules of composition) defined 
on the field Q of rational numbers, as follows: 

aW®b=a+b-] a®b=a+b— ab. 

Find the identity element, and the inverse of any number a, for each of these 
operations. 

Find all functions f: Q +> Q which simultaneously satisfy the equations 


fla+ δ) =f(a) @f(b) and f(ab) = f(a) @ f(b). (M.E.I.) 


14.3 Groups 


We are now ready to describe the vitally important structure known as a 
group, but before giving a general definition we shall first take two examples. 


Example 1 


Let us consider the set of integers {1, 3, 7, 9} and their products modulo 10 
(i.e., the remainders, or residues, when their ordinary products are divided 
by the modulus 10). 

As 3x7 = 21, 3x9 = 27, 9x7 = 63 etc., we may write 3x7 = l, 
3x9 = 7,9x7 = 3 (mod 10), and these products can be summarised: 


Table 1 
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The system has the following features. First, it is closed. Had we taken the 
set {3, 5, 7}, this would have failed for closure, for products such as 3 x 7 
would not have been contained inside the set. Second, the operation of 
multiplication is undoubtedly associative. Third, we have an identity 
element, 1. And fourth, every element has an inverse; e.g. since 3 x 7 = 1, 
the residues 3 and 7 are a pair of inverses. (This would not have been so 
had the number 5 been included, for the element 5 would have no inverse 
though closure would still have held good.) A system which passes these 
four tests — of closure, associativity, identity and inverses — is called a group, 


and as the present example has four elements, it is said to be a group of 
order 4. 


Example 2 


We now give an example of an infinite group, provided by the set of all 
non-singular 2 Χ 2 matrices with real elements under the operation of matrix 
multiplication. 


It is immediately clear that 


(1) The product of two such matrices with real elements is another such 
matrix, so closure is established. 


(1) Multiplication of matrices is known to be associative. 


1 0 
(111) The set includes the unit matrix ; . 
(iv) Every non-singular 2 x 2 matrix has an inverse which is also in the 
set. 


So the fourfold requirements for a group are satisfied. 


This time we have not only an infinite group, but one in which the binary 
operation is non-commutative, since matrices do not commute under 
multiplication. By contrast, our first example:} {1, 3, 7,9}, x (mod 10) 
was a finite commutative group. Such a group, in which every product is com- 
mutative is called Abelian, after the Norwegian pioneer of group theory, 
N. H. Abel (1802-29). Note that, within non-Abelian groups, it is possible 
for some products to commute: the identity will always commute with 
every element; and in the group of non singular 2 x 2 matrices under 
multiplication, we note that, 


(> ἡ(. ἡ-:.ἡτί ὦ 


Having provided these two very dissimilar examples, we are now ready 
to give our general definition: 


+ We shall specify a group by stating first the set, then the binary operation, these being 
separated by a comma. 
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A group is a set G with a binary operation * subject to the following 
requirements: 


* is closed in G, 1.e., for allx, yEG, x*yEG 


* is associative, i.e., for allx,y,zEG, (x*y)*Z=x* (y *Z) 


There is an identity or neutral element ὁ satisfying 
x*e=exx=x, forallxeG 
Each element x has an inverse, x *, satisfying 


- - 
χεχ tox tex =e 


If we now use the symbols V to denote ‘for all’, 4 to denote ‘there exists’ 
and : to denote ‘such that’, these four axioms of a group G, * may be con- 
viently summarised : 


Vx, y EG, x*xyeEG 
Vx, y, 2EG, (x*y)*Z= Xx * (γ * Z) 


dee G: VxeG, Lee] cen = % 
1 


Vx eG, 3χ hi χε x7 


If you refer to the axioms for a field (see 14.1), you will notice that 
Α41-4 and M1— are equivalent to stating that we should have a group 
under addition and under multiplication, always remembering that the 
‘zero’ element (the identity for addition) must be removed in the case of 
the multiplicative group, since it has no multiplicative inverse. Thus the 
rationals, Q, are a group under addition, and also group under multiplica- 
tion, provided 0 is excluded (just as all singular matrices had to be ex- 
punged from the set before group structure of 2 x 2 matrices could be 
claimed). 


Exercise 14.3a 


1 Show that the following are finite groups and state the order of the 
group in each case 
(2) integers with a specified number of binary digits under the operation 
of addition mod 2 with no carrying (e.g., O11 * 110 = 101); 
Role Hoe 

x+tl« l—-x 
(111) all the 120 permutations of five letters ; 
(ἢ) the set {0,1,2,...,2 — 1} under addition mod n; 
( 
( 


under function composition ; 


v) the set {1, 2,4, 7, 8, 11, 13, 14} under multiplication mod 15; 
vi) all the symmetries (rotations and reflections) of the rectangle; the 
square; the equilateral triangle, the cuboid, etc. 
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2 Show that the followingt are infinite groups 

() Z, +5 (ὦ αὶ +3; (at) R*, x; (ὦ) Q*, χ; (v) Χ φ matrices, 
under matrix addition (suggest numbers systems from which the elements 
might be drawn) ; 

(01) m X nnon-singular matrices under matrix multiplication; 

(vi) all isometries in the plane (i.e., transformations which preserve dis- 
tance) under successive composition ; 

(viz) vectors in 2D space under vector addition; 

(zx) all integral powers of 3 under multiplication. 


3 Find subsets of the examples in no. 1 and no. 2 above which are them- 
selves groups (called subgroups) under the operation mentioned. 


4 Say why the following fail to be groups: 
(2) odd integers under addition; 

(1) integers under multiplication ; 

(111) sets under the binary operation ‘intersection’ ; 

(10) the negative rationals, under addition; 

(vy) {a+ b/2+ c./3} (a, b, ¢€ Z), under multiplication; 
(v2 

( 

( 

( 


vi) {a + b/2 + c/3} (a, b, c€ Q), under multiplication; 


Δ th tag | 2 5. 4 I <2. 3.4 ess ate: 
vit) the permutations 214 3 3419 4.5 α 1] 


viii) {1, —1, 7} under multiplication; 
ix) all irrationals together with zero, under addition. 


5 Show that the first table is a group table, but the second is not. In the 
third, fill in the blanks in the group table, given that it is Abelian, and that 
every element (say g) satisfies σ΄ = | (identity) 


abcde 


lpqrstuvw 


lpqrstuvuw 


pqit 
φιὶιρ. 


ΞΟ ™ & ~*~ QQ Se καὶ 


Ξ 


6 Show that the set {x, 1, —x, —1} of linear polynomials in x, wr.. 
multiplied modulo (x* + 1), are a group, and write out the group tizdie 


cos 8 —sin @ 


7 Show that the matrices ( are a group under matrix 


sin@ cos@ 
multiplication. What is the order of the group when θ = 15°? 


+ See Notation (p. xiii). 
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8 The set S consists of all transformations 7 of the real numbers given by 
T(x) = ax +b (a #0, a, ὃ real). 
Show that if T,, T, € S, then Τ᾽, T, € S. Prove that the set S forms a group 
under composition of functions. Find the elements T of S of order 2 (1e., 
TT = 1and T #1 where J is the identity transformation). Prove that 
these elements are all reflections. Show that there are no elements of order 
3 in ὁ. (J.M.B. ) 


Ϊ 
9 Let G be the set of all matrices of the form ᾿ ᾿ with x a real 


number. 

(1) Show that G does not form a group under matrix addition. 

(ii) Show that G does form a group under matrix multiplication. 

(You may assume associativity of matrix addition and multiplication.) 
(J.M.B. ) 


10 Gisa finite group, and g is an element other than the identity. A set ὁ 
consists of all positive integral powers of g (so that S = {g, σ΄, g’,...}). 
Prove that: (1) the identity belongs to S; (i) g~' belongs to S. 

Give an example to show that neither of these need be true if the group 1s 
infinite. (S.M.P. } 


We now resume the illustration of groups in a variety of contexts. 


Infinite groups 


Other examples of infinite groups include the set of plane rotations and 
translations (see p. 478), the law of composition being as described there ; in 
three dimensions, the set of transformations (called isometries), such as 
rotations, translations and reflections, which preserve distance between all 
pairs of points; and also the set of vectors in three-dimensional space under 
vector addition. In each case, it should be confirmed that the four require- 
ments for a group are satisfied. 


finite groups 


Turning to groups with a finite number of elements, we saw at the beginning 
of this section that there is a useful way of displaying all the products of 
elements in the group in the form of ἃ table, and this is valuable in the case 
of small groups. For example, the matrices 


1 0 = = 
vee | ae 6 0) ees 1 0) ee ce ae 
0 1 an 0 1 0 --Ἰ 


under multiplication combine according to the entries of 
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Table 2 


xX 
Xx 
I 

Z 


Y 


and this describes the group completely. It will be recalled that precisely 
the same table arose (p. 480) when we combined the half-turns of the rec- 
tangular box, so that the table serves to describe the composition of the 
various rotational symmetries of the cuboid. 


By contrast, a quite different table arises when the four matrices 


1 0 0-1 0 1 -l1 0 
[T= ΞΞΞ — = 

01) ἀπτί Ὁ 8a) #0 ἡ 
combine under multiplication: 


Table 3 


These matrices correspond to rotations about the origin through angles 
0, ἐπ, m, ὅπ, and so the table describes the rotational symmetries of a 
figure like 


ater 
ZN 


Permutation groups 

Next we consider a group of permutations. 
Ὡς a 

Let x be the permutation 


2.3945 ᾿ 1.e., x replaces 1 by 2, 
2 by 3,..., 5 by 1. 
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It is usual to denote such a ‘cycle’ by (1 2 3 4 5), each digit in the cycle 
enclosed in brackets being replaced by the succeeding one, the cycle being 
completed by the replacement of the final digit by the first. (So that this 
particular cycle might also be denoted as (4 5 1 2 3), and in various 
other ways). Similarly, the permutation 


fl 2 Sb 5 6 
FN Ge Ὁ: Ὁ ἢ: 9 
can be shown in cycle notation as y = (1 4 2 6 3 5); while the permuta- 
tion 


{i 2 3 4 5. Ὁ 
“"\5 3 6 4 1 2 
contains two cycles, a = (1 5), and p = (2 3 6), the digit 4 being un- 
changed by the permutation. We also see that the cycle x = (1 2 3 4 5), 


when applied repeatedly to the five digits, changes the line successively as 
follows: 


Se se a Py ee 


τς δ... a 1-9-3 


so five applications of the permutation x lead to the zdentity permutation, and 
we write χα ἘΔ *¥X*X*X = 6. 

From now on we shall usually drop the symbol *, or whatever sign is 
used for the binary operation, and use juxtaposition to denote the composition 
of two elements: thus x * y will be abbreviated to xy (first y, then x). By 
analogy with algebra, we also use x° to abbreviate xxxxx and, since we are 
now committed to a multiplicative notation, we shall henceforward use 1 for 
the identity element. Thus, in the above, x” = | describes the behaviour of 
the cycle x = (12345). 


12 3 4 5 
Evidently, since x? = [ 45 ἢ >) 


we may write x” in cycle notation as (1 3 5 2 4), though it is easy enough 
to do this without first writing down the cumbersome replacement notation. 
The permutations 1, x, x*, x°, x7 combine according to 


Table 4 
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and this particular type of group, generated by a single cycle, is called a 
cyclic group and denoted C,. In a similar way, the permutation 


123 45 6 
= =(1 4 5 
m=(ésa13)t δ. δ ἃ. 


generates a cyclic group, this time containing 6 elements, or a group of 
order 6, called Cg. More generally, the cyclic group of order n is abbre- 
viated (ὦ,. 


Finite arithmetic 


Suppose in the case of the permutation x in the previous paragraph we had 
tabulated not 1, x, x*, x°, and x*, but the ‘indices’ (i.e., the number of times 
the permutation x had been applied, namely 0, 1, 2, 3, and 4). Then we 
obtain 


Table 5 


= x’ = x? (asx? = 1). So far as the indices are concerned, we 
may say 4 + 3 = 7, but since any superfluous 5’s are of no interest, we 
“may work modulo 5, and declare that 4 + 3 = 2 (mod 5). Thus in Table 5 
above, the operation is addition mod 5 and we have an example of a finite 
arithmetic providing us with a group. 

More interesting groups arise in finite arithmetics when the operation 
is multiplication to some modulus: for example, the group of order 4 at the 
beginning of this section. Another simple example is provided by the set 
{0, 1, 2, 3, 4} under multiplication mod 5, when we obtain 


Table 6 
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Here there is an identity element 1, but the element 0 has no inverse, so 
the structure is not a group. If, however, 0 is deleted from the set, we do 
have group structure, described by 


Table 7 


22 
ὅ: ἢν», Wp, Ae τῷ 
4 ]4 3 2] 


Sometimes with finite arithmetic under multiplication it 1s necessary to 
delete more than merely 0 from the set. For example, working mod 6 on 
the set 1, 2, 3, 4, 5, we arrive at 


Table 8 


and we see that, with identity 1, the only elements which are inverses are 
1 and 5. So we must omit 2, 3 and 4 to obtain the set {1, 5} with only two 
elements forming a group under multiplication mod 6. Nevertheless, the 
set {2, 4} also forms a group under x ¢, only here the ‘identity’ is 4: 


Table 9 
x,| 4 2 
4,4 2 
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Again, working mod 15, the set {1, 2, 4, 8} is closed under multiplication: 


Table Io 


But it is more surprising to discover the set {3, 6, 9, 12} which has 6 as its 
identity under multiplication mod 15: 


Table 11 


IZ. [042 ὦ. 3S ὦ 


Note that in Table 10, the elements 1, 2, 4, 8 can be written as 1, 2, 27, 2°, 
with 2* = 1 (mod 15), so there is a certain resemblance to the situation 
arising from R* = Jin Table 3. Similarly in Table 11, the elements may 
be written 3, 37, 3° (= 12), 3* = 81 = 6 (mod 15). 


Symmetry groups 


The finite groups illustrated above have been drawn mostly from groups of 
permutations and from finite arithmetics—groups of residues to some 
modulus under addition and multiplication. But there have also been cases 
of groups of matrices and groups of symmetries, and we shall now have a 
little more to say about the latter. 

-It was clear on p. 494 that a plane figure possessing rotational symmetry 
gave rise to a group of order 4. Similarly, a figure with rotational symmetry 
of order 9, i.e., which is unchanged by rotations though multiples of 40°, 
as in the case of the ‘plate’ illustrated, will have a symmetry group whose 
elements are rotations through 0°, 40°, 80°, ..., 280°, 320°. 

Calling r an anti-clockwise rotation through 40°, the elements of the group 
(Cg) will be 1, 7, τῶ, 7°, τῇ, 7°, 7°, γῇ, 72; where, for example, 177° = 7°, 
indices being added modulo 9, since 7” = 1. It is clear that such rotations 
may be represented by permutations of the vertices; for example, rotation 7 
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replaces the vertices 0, 1, 2,..., 8 by 8, 1, 2,..., 7 respectively, and so cor- 
responds to the cycle (0 8 7 6 5 4 3 2 1). Moreover, the same set of 
rotations may be represented by matrices; 7, for example, being represented 


by 

cos 405 —sin 40° 
= 40° COs τ 
So we have a set of nine matrices forming the group of order 9 under matrix 
multiplication. Again, in the Argand diagram, rotation through 40° is the 
geometrical equivalent of multiplication by the complex number cis 40°. 
Calling this complex number ὦ, we have the nine complex numbers 
{1, ὦ, ὦ", οὖ, οἷ, w°, w°, w’, w*} forming the group of order 9 under mul- 
tiplication, e.g., Οὗ X w? = ὦ (remembering w°? = 1). These are the 
nine complex ninth roots of unity (cf. p. 135). 


Groups also appear in the case of sets of functions under function com- 
position (see 481). For example, if 


x— 1 
LG. arse 
hen Oe 7 OSS (es 
x+]1 x+ 1 
Οχ-Ίπχπτιὶ 1 
ee a ee ae wee | x? 
1 
ἃ πεν 
ἃ as το 
-- ΕἸ 


and finally, f*(x) = x = ἐ(χ), where 1 is the identity function. 
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So these four functions: 7, f, f*, f° combine according to 


Table 12 


Cyclic groups 


Apart from the group of Table 2, all the finite groups mentioned so far 
have been cyclic groups. This means that all the elements of the groups may 
be expressed in terms of a single element, or generator (as it is called). Thus in the 
case of the functions in Table 12 above, the elements were i, f, f*, f°, where i 
might be expressed as f°, fis of course 77, and 7,2 might have been written 
Κ΄, the group being governed by the over-riding relation {* = 7. Indeed, 
whenever (as in Table 3) we have a situation of the type x* = 1, there we 
must have the cyclic group of order 4, usually called C,; and the same 
group would arise in the multiplication of the set {1, 7, —1, —7}, where 
1 -- --Ἰ- 


Exercise 14.3b 


1 Write out the table for multiplication of the set {1, 3,5, 9, 11, 13} under 
multiplication mod 14, and show that it is a group. 


2 Show that, if x is the permutation 
123 4 5 
4 15 2 3} 
and ¢ is the identity permutation, then 6. x, x? x3, x4 and x°, forma group. 
3 What is the order of the group generated by the permutation 
123 45 67 8 
Yi : 
27 48 5 1 6 3 
4 Show that {1, 4, 7, 13}, x 1, is a group, and find a group under Χ ,; 
which contains the number 3. 


5 Finda group under Χ 9,9, containing the number 8. 


6 Show that the functions χες x, 1 — x, 1 — 1/x, together with their 
reciprocals form a group under function composition. Verify that it is not 
commutative. 
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7 What is the order of the symmetry group taking both rotations and 
reflections in all cases, of: 

(1) a playing card (say Jack of Hearts) ; 

(1) an isosceles triangle; 

(111) a rhombus; 

(tv) a pyramid on a regular pentagonal base; 

(v) aregular hexagon; 

(vt) a square. 

8 Draw examples of plane figures whose symmetry groups have order 
3, 4, etc. 


9 Show that the permutations 


123 4 5 6 12 3 4 5 6 
and 
4 65 1 3 2 65 4 3 2 1 
when combined in every possible way give rise to a group of order six. 


10 ‘Take the matrices 


1 O 0 1 
4=(( Bi and B= (‘ ᾽ 


and consider all possible matrices obtained by combining them by multi- 
plication (e.g., AB, BA, ABA, etc.). Show that these form a group of order 8. 


1: Show that the complex 12th roots of 1 form the group C,,. under 
multiplication. Is it possible to obtain the group C,, under addition of 
complex numbers? 


12 Obtain a set of permutations forming the group C,. Also obtain a set 
of permutations of seven objects forming the group C9. 


13. What are the periods of each element in the groups Cy, C,,? 


14 By writing each of the set {1, 2, 3,..., 12} as a power of 2 (mod 13), 
e.g., 12 = 2° (mod 13), show that this set forms the group Cj 9. 


15 A rectangular cuboid has the following symmetries: 

(1) half-turns about three mutually perpendicular axes (call these a, 4, ὁ); 
(11) reflections in three planes of symmetry. Call these p, g, 7, where p is the 
reflection in the plane containing the axes of ὦ and ¢, etc. 

If the identity is called 4, write out the table showing the composition of 
6, a, ὦ, c and show that this is a group. What is its structure? Write out the 
table showing the composition of the seven symmetries listed, showing that 
a group is obtained (the ‘full group’ of the cuboid), only when an eighth 
symmetry is included, and specify the nature of this eighth element. 


16 In the set of ordered pairs of reals under the operation * defined by 
the formula ; 
(X15 41) * (X25 Y2) = (λ χα, X12 + 91); 


what is the identity, and what is the inverse of (x, y)? What must be done 
to arrive at group structure? 
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14.4 Isomorphisms 


Saying that Table 12 is the ‘same’ group as Table τὸ and Table 11 is 
rather like saying that 3 is the ‘same’ fraction as ὃ and ὡς. It is more correct 
to describe these fractions which are equal to } as belonging to an ‘equiva- 
lence class’ of fractions, of which it is convenient to take $ as the simplest 
representative. The tables for the groups discussed are the ‘same’ in the 
sense that they have identical patterns, as one can readily see by the 
‘stripes’ running down from N.E. to 8.W. The stripes are not apparent 
in the case of Table 3 till a rearrangement is made by interchanging Rand S: 


Table 13 


The correspondence with Table 14 (below) for {1, 1, —1, —7}, x is now 
clear when the elements are paired off: 


Teo; Ro 1; Ho --Ἰ: So --ἰ. 


Table 14 


This conforms with our knowledge that multiplication by 7 is represented 
geometrically by a quarter turn (Rk). However, there would be nothing 
to stop us setting up the correspondence 


Io], Reo -1, Η «» —-1, So 1, 


with R and S interchanged. The reader should do the same in the case of 
the group whose Table 7 is shown on p. 497. 

Groups which have identical structures are said to be zsomorphic, and the 
idea of isomorphism is one of the most important in mathematics. Indeed, 


14.4 ISOMORPHISMS 503 


the fact that the same pattern may arise in a wide variety of situations is the 
main justification for a close study of structure as it means that techniques 
devised for one situation may well be applicable in a seemingly different 
context, because the two situations may, for all their apparent diversity, 
really be examples of the same structure. The relation between two groups 
of being isomorphic is usually denoted ‘ =’. It will emerge later that this 15 
a so-called equivalence relation between groups, and that isomorphic groups 
form an equivalence class, which may be called the abstract group of that 
structure. 


Groups of order 4 


We have already seen, however, that there are two quite distinct groups 
of order 4: 


and 


The first is the cyclic group C,, and we have had several examples ranging 
from the rotations of the figure on p. 494 to the set {1, 2, 3, 4} under multi- 
plication mod 5. Other examples of this group are to be found in Tables 1, 3, 
7, 10, 11, 12, 13 and 14. 

The second, although so far provided with only two illustrations, (p. 480 
and Table 2), is no less important. This is known as the ‘ Klein 4-Group’, and 
contains three elements of period 2. Other representations of this group 
which the reader may check are {1, 3, 5, 7} x mod 8, and the combination 
of the four functions x, —x, 1/x, —1/x. For representations of the Klein 
4-Group in a large number of contexts, see Budden, F. J., The Fascination of 
Groups, C.U.P., 1972, pp. 131-8. 

‘The above two are, in fact, the only possible distinct groups of order 4. 
As for groups of higher orders, it is easy to show that there is only one pos- 
sible group of order n whenever n is prime; whilst for composite values of n, 


n=6: 2groups_ (including one non-Abelian) 
n= 8: Sgroups (two non-Abelian) 

n= 9: 2 groups 

n= 10: 2groups (one non-Abelian) 

n= 12: Sgroups (three non-Abelian) 
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Groups of order 6 


Consider the three sets of permutations: 


(2) (2) (11) 


1/12 3 4 5 6 112 3 4 5 1128 
x 123 45 6 1 r|}3 12 4 5 p}3 1 2 
x713 4 5 6 1 2 Plo 3 14 5 0 9. ἡ 
χ 4 5 6 12 3 d1/12 3 5 4 «1 3 2 
x*1/5 6 12 3 4 y|3 12 5 4 b)/3 2 1 
16 12 3 4 8 z{23 15 4 δ ΠΣ ΤῈ Ὁ 


It may easily be verified that each of these sets is closed, and that in each 
case we get a group of order 6: 


Table 15 


(It should be noted that the permutation which is performed first is shown 
across the top of the table, the second permutation being shown down the 
side, so that in Table (ii), for example, ac = p signifies that permutation c 
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is performed first, so that the a is read offon the left and the ὁ at the top). We 
shall now show that the first two are isomorphic, but that their structure is 
different from that of the third group. 

First note that x may be denoted in cycle notation as (1 2 3 4 5 6), 
and clearly x© = 1, whereas x” 4 1 for n = 1, 2, 3, 4, 5. We say that the 
permutation x has period 6 or order 6, meaning that 6 repetitions of the per- 
mutation are needed in order to δεῖ oe to the identity. The 6 permuta- 
tions of the group are thus {1, x, x*, x°, x*, x°}, and the group is Cg. 

Looking at the second set of peptone we see that r and r* are ies 
of period 3, that d is of period 2, but when we come to y (= dr = rd), w 
have a permutation of period 6. This encourages us to begin to set up a 
correspondence between the first two groups by matching x with y. Indeed, 
it turns out that y2 = 77, γ᾽ = d,y* = 1,y° = z, so that we have the cyclic 
group {1, y, γ΄, γ΄, γ΄, γ᾽} governed by y° = 1. The tables for the first two 
groups would thus correspond exactly; for wherever x occurs in the first, 
we should have y in the second. This same group would arise in other 
situations, in fact whenever an equation of the type x° = 1 appears, e.g., 
if w = cis 60°, w® = 1; or if R were a rotation through 60°, we should 
have R° = 7 (the identity); if M = ( ᾿ then M®° = J; again, 
since 3° = 1 (mod 14), we have the group C, appearing as a multiplicative 
group in a finite arithmetic. 

The group of the third set of permutations is different from the first two 
in the sense that, while we can easily enough set up a 1,1 correspondence, 
we cannot do so in such a way as to preserve products. That this must be so 15 
immediately apparent from the fact that the third group contains no 
permutation of period 6, the periods of permutations being respectively 
1, 3, 3, 2, 2, 2. Another easy way of seeing that the third group is different 
is because, unlike the first two, it is not Abelian; for ὁ = (1 3),c = (1 2), 
then remembering that be means ‘first c, then δ᾽, we have bc = (1 2 3) = p’, 
whereas ch = (1 3 2) = p. This non-Abelian group of order 6 will be dis- 
cussed in more detail on p. 509 when we consider the symmetries of the 
equilateral triangle. 


Isomorphism of infinite groups 


Now it is all very well saying that two groups are isomorphic if their tables 
may be rearranged to correspond, but this is only meaningful in the case of 
finite groups. How may one describe isomorphism in the case of infinite 
groups? To answer this, we must give a more precise definition for two 
groups G, G’ with different operations o and τ: the groups G, ο and G’, * 
are said to be isomorphic provided there is a one-to-one correspondence 
between the elements 


Giecce GO And? #4 ye. EG: 


such that for all x, yeE G and x’, y’ EG’, it is also true that x o y corresponds to 
“ , 
x ey’, 
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Now take the group Α΄, x of positive reals under multiplication and 
the group R, + of all reals under addition. 

These can easily be shown to be isomorphic by the simple process of 
mapping the first set onto the second by the logarithm function x +> In x. 
Then if z = xy in the first group, we have In z = Inx + Iny, so that 
z’ = x’ + y’ in the second group and the structural properties of the first 
group are reproduced in the second (where the operation is, of course, 
addition). 

Again, consider the groups Z, +; 10Z, +; and also the geometrical 
group generated by a translation ¢: 


al 


We may show that these three groups are in fact isomorphic by setting 
up a correspondence where x in Z, + corresponds with 10x in 10Z, + and 
to & in the group of translations. Then * products’ (as we shall continue to 
call them, though the operation is addition) are preserved. 


For example, —5 + 2 = —3 inthe group Z, + 
becomes —50 + 20 = —30 in the second group, 
and ¢ ° x #@ =f? in the case of the third group. 
Or, more generally, x + y=z inZ,+ 
corresponds to 10x + 10y = 10z in 10Z, +, 
andto ¢ =f in the third group. 
Consider next the operation on the integers given by the formula 
m*en=m+t (—1)"n. 


This is clearly non-commutative, but is in fact associative. Closure 15 self- 
evident, while 0 is a left and right identity. Finally, the inverse of an even 
integer 18 its negative, e.g., 


18 κα (—18) = 18 + (—1)'8(—18) = 0, 
(—18) #18 = —18 + (—1)7 1818 = 0, 
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but the inverse of an odd integer is itself, e.g., 
19*19 = 19+ (-1)'919 =0 
(—19) κ (—19) = —19 + (—1)°'9(-19) = 0. 
Thus Z, * is a group. 


For a second group we take the series of images set up by reflection in 


two parallel mirrors. 
| 
| 
Ι 
Ι 
! 
| 
Ι 
! 
| 
Ι 
: 
| ba 
l 


-1 0 1 γε 


ab 
-2 


α axis b axis 


In the figure, a represents the operation of reflecting in the first mirror, 
ὃ represents a reflection in the second mirror, and following the usual 
convention, baba (for example) represents the result of two pairs of alternate 
reflections in the two mirrors, starting with the first. In the figure, each 
image of the bird motif is labelled according to the reflections leading to it. 
These two reflections generate an infinite group, the elements of which are 
every possible ‘ word’ formed from the two letters a and ὦ, e.g., abababababa, 
bababa, etc. (since a* = b* = 1, there will be no powers of a and ὁ higher 
than 1). 

We now show that these two infinite non-Abelian groups are isomor- 
phic. The method of setting up the correspondence between Z, * and 
the group generated by reflections a and ὦ is that shown in the figure 
whereby each image is labelled not only with the ‘word’ describing the 
reflections, but is also allotted an integer in the bottom row. The way the 
correspondence works out is that the number in the bottom row, regardless 
of sign, says how many letters there are in the ‘word’, e.g., —3 for aba; 
+6 for bababa; while the sign is + if the initial letter of the word is ὁ, — if 
it is a. The way in which the products are ‘imitated’ between the two 
groups 15 illustrated in two cases: 


Z, * Reflections 
5* 2= 3 (babab) (ba) = babab*a = bab 
—3* 3 = —8 (ababa) (bab) = abababab 
6* —3= 3. (bababa) (aba) = bababa*ba = bab (using a? = b? = 1). 


Thus the two groups are isomorphic, and in giving this example we have 
incidentally illustrated how groups may arise in the symmetries of an 
infinite geometrical pattern, here a one-dimensional ‘frieze’ pattern formed 
by the bird motifs. 
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Exercise 14.4a 


1 Show that {1,7, —1, —7z$, x is a group isomorphic to the group 
11,3, 7,9}, x mod 20, and to the group of rotations of the figure on p. 494, 
and also to the permutations {1, x, x?, χ where 


123 4 939 6 
x= : 
2145 6 3 


2 Invent other examples of the Klein four group other than those in the 
text: 

(¢) asa finite arithmetic under multiplication ; 

(11) as aset of permutations (of four objects) ; 

(11) asaset offour 2 x 2 matrices; 

(10) as asymmetry group of a plane figure other than the rectangle. 


3 Why is it not possible to set up an isomorphism between 

(1) the symmetries of the equilateral triangle and the rotations of a 
regular hexagon; 

(72) the symmetries of the ellipse and {1, 3, 7, 9}, X 49 

(111) the complex numbers under addition and the vectors in three- 
dimensional space under addition; 

(tv) the positive reals under multiplication, and the rationals under addi- 
tion. 


4 Ifa group is isomorphic to a proper subgroup of itself, what must 
necessarily be true? 


5 Show that {2", eZ}, x is isomorphic to the set of translations in a 
line generated by a single given translation. 


6 Let S be the set {x: 0 < x < 1, xe R}. Show that S with the operation 
addition mod 1 (e.g., 0.7 + 0.84 = 0.54), is a group, and that this is iso- 
morphic to the group of rotations of the circle. 


ἢ Establish the isomorphism between the groups shown in these two 
tables: 


8 Inthe group Cy {0, 1, 2,..., 8} +9, which elements are generators? 
Which are generators in similar representations of Cg, Cio, C;2? If every 
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element of the cyclic group C,, is a generator (except the identity), what can 
you say? 


g Consider the group {1, p, p”, p°} with p* = 1. Show that the permutation 
of the elements which interchanges p and p” preserves the structure, but no 
other permutation of the four elements has this property. (Such a permuta- 
tion is known as an ‘automorphism’, in this case of (4). 


10 Find the automorphisms of the group {1, a, ὁ, c} with a? = 6? = 1, 
ab = ba =. 


11 Show that the correspondence x «+ —x is an automorphism of Z, +. 


12 Show that {1, a, «7, «3, αὖ, α΄, a°}, x where α΄ = 1 has six automor- 
phisms, one of which is « -- αὖ, and find the others. Find the automorphisms 
of the group C, formed by the eighth complex roots of 1. 


The group of equilateral triangle, D 


So far the only non-Abelian finite group has been the one of order 6 con- 
sisting of every possible permutation of 3 digits. This is the smallest case of a 
non-Abelian group, and we now look at it in more detail. In setting up the 
table showing the products, we must (since many of the products are non- 
commutative) specify that when we read the product xy from row x and 
column y, this is to be the value of xy; so y, the first operation, is shown across 
the top of the table, and the second operation is down the left-hand side. 
The table showing the composition of the permutations will be found to be 


Table 16 


Ist operation 


2nd operation 


To complete the table, it is not necessary to find every single product from 
the permutations as we did on p. 505. For, once having obtained bc = gq, 
we may immediately deduce (for example) : 

bg = bbc = ῥξε = ο, since b? = 1 

l 

ch = bap = δ] = δ, since gp = 1, and so on. 


gc = bec = bc? = ὁ, since οἷ 
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We next show that the same group arises as a group of symmetries. 
Consider an equilateral triangle 1 2 3 which is subjected to the following 
rotations : 


p = (132). 


] 3 
DV 7% ᾿ 
2 3 l 2 3 


It will be seen that p is an anti-clockwise or a clockwise rotation according 
to whether the triangle has or has not been turned over (is showing ‘heads’ 
or ‘tails’); g is simply the inverse of this, so g = p_' = p*; whilst a is a 
rotation about the median through 1, so interchanges the vertices 2 and 3. 
Similarly, ὁ = (1 3) andc = (1 2). These five rotations together with the 
identity constitute the symmetry group of the equilateral triangle. It is 
clear, not only that every rotation has been matched with a corresponding 
permutation of the vertices, but also that the rotations will combine in a 
way which exactly mimics the composition of the permutations. So we have 
an isomorphism. For example, corresponding to the equation pc = a 
expressing the composition of the permutations (1 3 2) (1 2) = (2 3), 
we observe that if the triangle is subjected first to a rotation about the 
median through 3 (thereby turning it over) and then to a (now clockwise) 
120° turn p, we obtain the final position which would arise from a single 
rotation about the median through 1, 1.6., 4 = (2 3): 


] 


Note that 
(2) a, b, ¢ were described as rotations, taking the triangle into three dimen- 
sions, but could equally well have been regarded as reflections; and 
(11) the triangle could just as well have been a uniform triangular prism, 
the rotations a, b, c being about symmetry axes situated midway between 
the triangular faces. 

In the previous paragraph we have been considering the group of sym- 
metries of the equilateral triangle which includes not only the plane 
rotational symmetries through 0, 120°, 240° (1, », 4) which would be 


possessed by a figure like « Δ , but also the bilateral symmetries 


about its three medians. Similarly, if we look at a square instead of an 
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equilateral triangle, it would possess not only the rotational symmetries, 
as in Table 3, but also the bilateral symmetries about four axes, 


d C b 


& 


and these will together give a group of order 8, the symmetry group of the 
square. The full group of rotations and reflections of a regular n-sided 
polygon is called the dihedral group of order 2n, and is denoted D,. A useful 
exercise for the reader would be to construct the group table for D,, using 
the notation of this figure where (for a change) we use a, ὦ, c, d to be fixed 
symmetry axes (unlike the axes which moved with the triangle in p. 510) 
while 7 is an anti-clockwise quarter-turn. The table should be set out with 
the elements in the order 1, 7, 77, 7°, a, ὁ, ο, d. 


Infinite symmetry groups 


Usually the symmetry group of a geometrical figure is finite, but we have 
already met (p. 507) one case of an infinite group, when the geometrical 
figure itself extended to infinity. The groups arising from repeating patterns 
are of great interest, but there is not spade to deal with them here, and the 
reader is referred to Budden, F. J., The Fascination of Groups, C.U.P., 1972, 
Chapter 26. Another case when we obtain an infinite symmetry group is 
that of the circle, which is not only symmetrical by rotation through any 
angle, but also possesses bilateral symmetry about any diameter. 


Symmetry groups of solids: the tetrahedral group 


We have already mentioned the symmetry groups of the cuboid (Table 2), 
and more recently the six rotational symmetries of the equilateral tri- 
angular prism. When dealing with the symmetries of three-dimensional 
geometrical figures, one has to specify whether one is concerned with 
rotations only (as above), or whether one is going to include reflective, or 
‘opposite’ symmetries, in which case the cuboid would have a group of 
order 8, and the triangular prism one of order 12. A good illustration of this 
is provided by the regular tetrahedron, which has eight 120° turn sym- 
metries about its four altitudes, but also three half-turns about its joins of 
mid-points of opposite edges which, with the identity, make a group of 
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order 12. If the vertices were labelled 1, 2, 3, 4, the 120° turn would corres- 
pond to such permutations as (1 2 3), while the half-turns correspond to 
(1 2) (3 4), etc. When reflections are included we should get all the other 
12 permutations of the four vertices, such as (1 2), (1 2 3 4) etc., which 
cannot be obtained as a result of direct geometrical movements, but only 
with the aid of mirrors! The group of all 24 (= 4!) permutations of 4 digits 
is called the symmetric group, S,. More generally, the group of permutations 
of n symbols is the symmetric group S,, and is of order n!. On p. 509, we 
saw that the group of the equilateral triangle was in fact isomorphic to ὅς, 
so we may write D, = ὅς. As an exercise, the reader should construct the 
group of order 12 of the rotations of the regular tetrahedron. This should 
be done using the permutations p = (1 2 3) and a= (] 2) (3 4) 
described above, and it will be found that the whole group is generated by 
using these two permutations only, i.e., by forming all possible ‘words’ 
from a and p. The group is known as the tetrahedral group, or the alternating 
group of order 12. 


Exercise 14.4b 


1 What are the symmetry groups of the octagons: 


ὑπο 


(iit) (iv) (v) (vi) 


2 Draw hexagons whose symmetry groups are C3, C,, Do, D3. 


4 Show that two mirrors at an angle of 90° will produce a pattern with the 
symmetry group D,; and that if the angle were 60°, the group would be 
D,. Generalise. 


4 Show that the matrices 


01 —] 0 
«-( 7 and B=( 0 ; 


generate a group of order 8, and that this group is isomorphic with the 
group of symmetries of the square. 


5 Iff(x) = 2 — x and g(x) = 2/x, find gf (x) and fg(x), and the other 
functions generated by f and g. What group do they form? 


6 Note that the group of the equilateral triangle has the following rela- 
tions connecting the elements: 


ap = p'a; ap? = pa; pap=a; (αὐ) =1; aba = bab, 
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and interpret these in geometric terms. Find other relations connecting: 
(1) b and φ; (11) bande. 

Show that the relations a? = p? =1, ap τ ῥα are sufficient to deter- 
mine the group completely, and find another set of ‘defining relations’ 
based on the ‘generators’ a and ὁ. 


+ Fora regular hexagon, let r denote a 60° rotation about its centre, and 
a, b, c, d, f, g be reflections in the six lines of symmetry. Show that, a = rar, 
and obtain ὁ, c, d, f; g each in terms of the generators a and 1, it being sup- 
posed that the six axes are labelled consecutively in the direction of the 
rotation r. Draw up the group table of the twelve symmetries of the regular 
hexagon (D,). 


8 Discuss various tetrahedra with rotation groups smaller than the group 
of the regular tetrahedron (e.g., a tetrahedron with one face equilateral 
and the other three congruent isosceles triangles has the group C3). 


9 Consider the rotations of the cube and classify them according to their 
period (e.g., a rotation through 90° has period 4). 


10 Show that the regular octahedron has symmetry group isomorphic 
to that of the cube. How many of the symmetries are rotations, and how 
many are opposite? 


11 Investigate (infinite) symmetry groups generated by: 
(1) asingle glide reflection; 

(11) reflections in two parallel mirrors ; 

(111) reflections in two mirrors at various angles; 

(10) a half-turn and a translation; 

(v) ahalf-turn and a mirror reflection. 

I 


nvent other examples. 


Epilogue 


How many numbers are there? 


We have come a long way since first learning to count and now retrace our 
steps to this apparently childish question. By now, however, we are aware 
of a variety of sets of numbers. In arithmetic modulo 2, for instance, there 
are Just two elements, 0 and 1, whilst in the system of complex numbers 
there is a far richer variety including such diverse numbers as 


0, 1, —3, 44, a2, 3 — 1, ete. 
In this epilogue we shall confine ourselves to the real numbers R and 
shall start by looking again at their subset of positive integers, Ζ΄. 


The positive integers, Z * 


0 ] 2 3 4 5 


A child very soon realises that there is no end to the sequence of natural 
numbers, or positive integers. Even so, they are capable of being counted, 
or set in order, because this is precisely what is done when we say ‘one, two, 
three, four, five,...: we name them in sequence, so that — given life 
and breath — any given number would ultimately be reached. We there- 
fore say that the positive integers Z* are countable. 


The integers, Z 


ee ee ee Ὁ Ὁ 40.1.5 


When, however, we consider all the integers (positive, negative and zero) 
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an immediate difference is apparent, that there is no obvious starting-point 
for such an operation of counting. Even so, they can clearly be set down in 
the order 


ΟΣ Aly = ly. 2). =] 24 Sy. 8, 


Here again, every integer — positive, zero or negative, and however 
large — has its place in this sequence, which therefore provides us with a 
means of threading our way through them. So, just as for natural numbers, 
the set Z of all integers 1s countable. 


The rational numbers, Q 


There are, of course, many gaps between the integers and into these gaps 
we can place other quotients of integers like 7, 3, —§, so obtaining the set Q 
of rational numbers: 


It is clear that there is no end to the number of such rationals that can be 
placed between two integers. The interval between 0 and 1, for instance, 
could be divided into tenths, hundredths, thousandths or millionths, and 
each of the points of subdivision would correspond to a rational number. 
So the number of rationals between 0 and | is clearly infinite. 

This is, of course, just as true of the number of rationals that can be 
placed between any two given rationals, however close they are. For if 
p/q and τῇς are the given rational numbers (p, 4, 7, 5 being integers), their 
mean is 


] [ ἢ ps + qr 
ae (ae yee ee ee 
2\q 5 245 

Now ps + grand 245 are integers, so this mean is 1561 rational number 
and we see that between any two rational numbers there is always another, 
and, therefore, by continued bisection, an infinity. 

There is, therefore, certainly no shortage of rational numbers and the 
former question again arises. Is it still possible to count our way through 
this multitude of rationals, or are they too numerous? Can we construct a 
sequence in which every rational number has its place? 

This question was first raised in 1873 by the German mathematician 
George Cantor and did not prove difficult for him to answer. Rather, as we 
shall see, did his genius lie in asking the question. 
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Let us start by considering the positive integers, together with the 
number 0. First we construct the pattern 


NO] — 
m=} NO 


1 3 
3 Hi 1 
1 9 3 4 


i τς OO wt 
1 A Yo 4 5 
5 A fB ὁ 1 


and delete any number which is not in its lowest terms. Now if this pattern 
is continued indefinitely, every positive rational number will have its 
place. We notice, for instance, that the fifth row consists of all positive 
rationals whose numerator and denominator add up to five and the sixth 
row consists of those whose numerator and denominator (when expressed 
in their lowest terms) add up to six. Similarly the rational number 4 
would occur in the 20th row; and, more generally, the number p/g (if 
expressed in its lowest terms) in the (p + φ) row. 

So, although the positive (and zero) rationals are extremely numerous, 
we have invented a way of counting them: 


0 
i 
1 


y* 
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Now we can just as easily count all the rationals (positive, zero and negative) 
simply by placing each negative rational immediately after the correspond- 
ing positive rational: 


0; mee =; +3; —3; ἘΠ, “π΄. +3; —F: 


Hence the set Q of all rationals 15 countable. 


The continuum of real numbers, R 


-3 -2 -l 0 ] 2 3 


There is such a multitude of rational numbers, with an infinity of them 
inside any interval, that the question naturally arises whether there 1s 
room for any other kind of number. Now we have seen that Euclid had 
effectively answered this question about 300 Βα when showed that J/2 is 
not a rational number and we naturally ask whether such numbers are 
rare or common. Are most numbers rational or irrational? Certainly we 
have seen that the rational numbers are countable. Gan we similarly 
count all the numbers on the straight line, rational and irrational? 
This question too was first asked by Cantor and at the beginning of Decem- 
ber in 1873 — just a hundred years ago as this epilogue is being written — 
he answered it by proving one of the most famous results of mathematics, 
that of the non-enumerability of the continuum: the real numbers R, rational and 
irrational, are too numerous to count. 

Let us consider the numbers between 0 and 1 and suppose that they can 
be arranged in a sequence 


Ay, Ag, 43, A4,---, 
which includes every such number. 

Now each of these numbers can be expressed as a decimal. It is true that 
there is a slight difficulty about decimals which terminate as they can be 
represented either with a recurring 0 or a recurring 9: 3, for instance, can 
be written 
either as 0.150 000... 
or as 0.149 999.... 


But in such cases we simply stipulate that a recurring 9 is never to be used. 
So each number can be represented uniquely in decimal form, and we can 
write 

A, = 0.05 10, οἵ, «14... 

Ay = 0.651 W905 4ho4-.- 

Ag = 0.518 500%53054..- 


Ag = 0.04 γα, οὔ, 01.4.4... 


as a sequence containing every number between 0 and 1. 
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Having done this, we now attempt to construct another number 


b= 0.6, B.B3B,. “: 


which is different from every a. Deciding to be as perverse as we possibly 
can, we see that this could be achieved if 5 were 


different from a, in its first decimal place, 
different from a, in its second decimal place, 


different from a, in its third decimal place, and so on. 


We therefore choose B,, B., Bz... by the following rule: 


ifd,, # 5, choose f, = 5, 


and ifa,, = 5, choose f, = 4. 


Then B, #4 4,,, for all values of n. 
Hence 6=0.6,f,B,... 


is different from every single member of the sequence a,, a2, a3.... By 
this means, therefore, we have succeeded in constructing a number between 
Q and 1 which is not a member of this sequence. Hence our initial assumption 
is contradicted, and we see that it is not possible to count even the real 
numbers between 0 and 1. It is, therefore, certainly impossible to count all 
the real numbers R, so that the continuum 15 not enumerable. 

This, of course, provides us with another proof, over two thousand years 
later than that of Euclid, that there are other numbers besides the rationals. 
For the rational numbers were countable, but the continuum is certainly 
not countable, so must be far more numerous; and whereas Euclid’s proof 
produces a single irrational number, Cantor produced an uncountable 
infinity all at once. 


Transcendental numbers 


It is now interesting to speculate about the numbers which constitute this 
uncountable multitude. Which are they? A first guess would be that they 
are irrational numbers like ΝΜ, 2; Ἵ 5 and Ἢ τι. But this is mistaken. For the 
positive rational numbers ΟἿ, as they are countable, can be written 


Tis Was, 20S Geek 
so their square roots are τ : ee ts 7 
their cube roots are oi : ae ; 13 = 


their fourth roots are Nig pa toa Tacs “6tG, 


EPILOGUE 519 


Now we can count through this set of numbers by this route: 


Τί » 7) fs. ae one, ἢ 


and so the set of all positive rational numbers and all their roots 1s countable. 

It can similarly be shown that the set of all polynomial equations with 
rational coefficients have roots (known as the set of algebraic numbers) 
which are also countable. So it is clear that there must exist a vast un- 
countable multitude of numbers which are not the roots of any such 
equation. These numbers, which are not algebraic, are called transcendental. 

Which numbers are transcendental? Certainly it is clear that no rational 
number or root of a rational number is transcendental, and we might well 
wonder what comprises this vast galaxy. Now by coincidence it was also 
proved in 1873 that e cannot be the root of any algebraic equation with 
rational coefficients, so must be just such a transcendental number; and in 
1882 it was shown that z is another. Even today, however, we know very 
few kinds of naturally occurring transcendental numbers: In 2, εἶ, 2%”, sin | 
have been proved transcendental, but they are rarities. 

What is known is simply that when all rationals, together with all roots 
of all rationals, and together with all the roots of all algebraic equations 
with rational coefficients, have been removed from the continuum, it has 
hardly been diminished, and that the infinitely vaster multitude of trans- 
cendental numbers has remained secretly in hiding. And in this, it might 
be said, these numbers are typical of the whole of mathematics, where 
much is known but much more is not. 


Answers to exercises 


Exercise 9.18 


I 


(1) 0.69 (#) 1.10 (i) 1.79 (iv) 2.20 
(v) 2.48 (vi) —0.69 (vii) —1.10 


Exercise 0.107 


I 


(1) 1.38630 (ἡ) 1.79176 (ii) 2.30259 (iv) 2.48491 
(v) —0.69315 (vi) —1.79176 (vii) -- 2.484 9] 
(viii) 13.81554 (ix) 0.346 57(5) (x) 1.535 06 

(@) 3 (i) 0 (ii) -1 (ὦ) 0.5 
( 


i) e? (ii) €1° (Δ) 1%/e (ὦ) 5 

᾿ς χ i) x (11) " (10) ; (v) ee (v2 ) ὃς (vit ) ᾿Ξ 
᾿ Ι “-- ΧΆ 

(υἱ1)} e401) (1x) ae (x) ara py (xt) 1 + Inx 
.. 1—Inx | 

(x77 ) 3 

(1) cote (i) —tanx (ii) tanx (iv) secx (v) cot 

_. 2Inx l 


(v1) 3cot 3x (vii) 2cotx (witi) Qcotx (ix) x 
x xin x 


(1) glnx (a) In(w -- 2) (iii) x ln (2χ + 3) (iv) x + Inx 
(v) —In(1 -- x) (wt) In (x? - 1) 
(vit) — 3 In (1 — x?) (vidi) 41m (x? — 2x + 3) (ix) $= In (x? + 3x) 


Τ᾿ In this and subsequent exercises, the arbitrary constant has been omitted from all indefinite 
integrals. 
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ae 
(x) Insinx (xt) 3lnsec3x (xu) 2insin5 


2 
(αἰ) In (sin x — cos x) (x) In (1 + tan x) 

7 (1) 0.9163 (i) 0.3365 (1) 0.1256 (w) -- 1.098 6 
(v) Impossible (vi) 0.49 (viz) — 0.8047 (viz) Impossible 
(ix) 0.693} (x) 0.3465 


8 (2) 0.69365 (i) 0.693 10 
9 (ἡ miIn2x2.18 (uw) πὶπ 4 ® 4.35 
10 2.21 


-“ | ee 


τι (i) 1.295, 12.71 (it) 


Exercise 9.2 


1 2.208 x 104, 2.692 x 1045, 1.105, 0.4529 x 107° 
2 (i) 2.303, 6.908, 13.82 (i) 4.605, 9.210, 13.82 


3 (i) 3e* (ἢ —te* (ii) (ΑΙ -- ) εὐ" (iv) (α΄ + 2x) δ 
--1χ2 , x e* = e~ τῷ Ἔ ‘ 
(v) —xe ?*" (ut) ar a (vii) e ~ (sin x — cos x) 
(viti) e?* (2 cos 3x — 3 sin 3x) (ix) 3¥*In3 (x) (Inx + 1)x* 
4 (i) --2 25. (ii) Be (iti) He?” (iv) &* (Ὁ) --ετ 
— 10% 
(γι) τῇ τὸ 
l l 99 
5 (1) 2(1 -ἢ (1) Se? (εἶ —1) (tt) 1 Te (tv) ae 
6 ae @ Sa aA 
[2 2 A 
“; oe ] 
8 (ἢ π ai nn + 47 
9 (11) 10 52% (tw) 6.93 days 
10 (1) τεσσ INn2 ῃ 4.33 x 1077 (ὦ) 4.3% 
11: (2) 565m (11) 13.8 min (21) Never 


Exercise 9.4a 
2 (i) l—x +42 -- ἰχ + dit (ii) 1 + Qe + 2x? + HP? + Ξχΐ 
(iit) 1 + 4x? + sox? (ὦ) x + Gx? 
3 (2) 1.64872 (a) 0.36788 (11 


|_ 


1.105 17 


= a 


Exercise 9.4b 
2 (1) ox — gx? + pox? — dx*® (ἢ) 2χ — 2x? + Bx? — 4x7 
(11) x? + 3x4 
3 (2) 1.099 (a) 1.511 (11) 2.610 
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Exercise 9.4c 
r (1) L+dx— ἀχ Ὁ 7x? jx] < 1 
(1) 1 --τχ Ἡ χ' -- χ' jx] < 1 


(v) 3 — 4x t+ fx? — qx? [x] < 2 
(vt) 2 -- ἰχ — tex? — A? [a] < 2 

2 (1) 1.00499 (aw) 0.99749 (11) 2.024 84 (2v) 3.009 97 
(v) 0.247512 (vz) 0.126 89 

3 4 

Ϊ 2 9 2 4 
4 atte "1327 oe 
Exercise 9.4d 


: (i) x —4x3 (ii) 1 — 4x? + ἀκ (iti) χ +409 (ὦ) 1 + 40? 44:3 x4 
(υ) x tx? +4x° (vi) ox? + ὡς i 

2 0.100 33, 0.017 45, 0.000 29 

3 1.035 53, 1.000 58 


Exercise 9.5a 


1 (i) x®&—e& (ii) sinx — xcosx (ii) Z(2x — 1) e™ 
x 


- 
iv) 4x2 Inx — 4x2 (0) 2χ sin 5 + 4cos= (vi) --- (Inx +1) 
2 (1) —(x? τ ῶῦχ + 2)e7* (it) Qxsinx + (2 — x*) cosx 


u10" 10" 


i) θ tan θ — logsec@ (ww) 10 


( 
( 
( (In 10)? 
4 (i) 4e* (sinx — cosx) (1) 3 δ᾽ (sinx + cos x) 
(iit) 4e7* (sin x — cosx) (iv) 5 δ᾽ (sin 2x — 2 cos 2x) 
(i) x{(Inx)? —2Inx +2} (ὦ) xtan>' x — Σ1π (1 + x7) 
(22) usin” > u + gu —u*) (iv) $(x? + 1) tan"' x — $x 
( 


Mio Gide Gio eG) -- 
2 


Exercise 9.5b 
δι... 16 397 


1 i) 5 Gi) τ (it) <= (iv) 5) See 
6 on. ΝΕ 
2 Oz (i) “τ΄ (iii) 0 (iv) = (v) 0 


4 x (Inx)? + 3x (Inx)? — 6xInx + 6x 


n 


4 I = ] tan”! θ ame I,-2 


+tan? 6 — tan@ + 6 
+ tan? 6 — }tan® 6 + tan@ — 0 
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Exercise 9.6a 


ἰῷ ae 3 Z 3 . ward 2 
oa x—2 ao x+2 (221) ox + 1 
2 4 $ l 2 
5 2 _ 9 ee) 
Cae πάπαν ees (v) rea 
Σ. 2 ] ] 
}.} --Ξ ἘΞ es 
(υ1) ἘΝ ἃ ναὸ (va) 1 as 
1 1 
Ὁ λον eg hea 
i l l Ὁ» ἥ 2 
2 Oo ay @o ΠΣ (#) - at ey 
a 2 q 1 5x — 3 
a 2 4 . 
(11) τ ef] (w) 1 χα ταὶ + ie nip 
ae x aU 1 1 
1) - -- ες; ΝΠ ΕΣ, ἘΦ Ψ 
Oe τἀπὶ on. eT 
4 1 ome 
fete, ΣΕ. τ. 
Rat ee ee xt] x"-x*x*4+1 


4 1 — 3x + 7x? — 15x? 


5 (2) r(r + 1) em ὦ l 
Σ ] n+ ] ] 
(a) --- , ———, — 
γ 2). 22) 2 
Ἂ ] l ] ] 
(ia) ———_—_____,  — — —_________-, - 
Hep Ὁ ἘΠῚ A 2a laa 2). ἃ 


Exercise 9.6b 

, ὦ) $n (ii) In - (iii) ἔτη“ 

ee 5 eo τ + nt ue) -— ti, 
(10) x -- ——— + 2In (x —}) 


3 (i) In (x* - 9): Stan! τὶ bin (x? + 9) + tan7"> 


(11) In (4x* + 9); bran! 55, ἔπη (4x? + 9) -- tan“! ΤΣ 
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(11) In (x* — 6x + 10); tan™! (x — 3); 
3 In (x? — 6x + 10) + 3 tan7?! (x — 3) 


. i ae 
4 (1) x + ἰὴ (x — 1) oe Mae rare (11) 3x7 + In (x — 1) 
(qv) pre ΣΝ. (x — 1) (v) ξ]π (x? -- 1) 
Cate ] τι eke ΞΕ] 


1 (ἢ) x° (ii) Ψ (iit) -= (iv) s/x (ὦ 25} (wi) A(x + 1) 


(v22 ) at ἢ (υ111) 5.) (3u 2s ἀπ ΞΕ" 
(x) ao (Ἱ) ἐ  - 1) Aan) —/(1 — x?) 


2 (1) 2lnx (1) 3In (x — 3) (1) —-m(1—u) (ὦ) 11 i — 1) 
(v) --ξ ἰπ (1 — x°) (υἱ) In (x? + χ) (vit) 2 ο25. (viti) 1 ε55 
Oh 


(ix) —de7* (x) In (1 +e") (xi) (xii) In (In x) 


n 10 
3 (1) ssin2x () —2 cos 5 (a) 3sin°@ (iw) —4 cos? θ 
(v) τίς βίη 30+ 2sinO (uw) --ξ οο50 Ὁ τ οο5 30 (vii) Fsin? x 
viii) a(x — sinx) (tx) tsinQx + 4x (x) 2,/(sin x) (xi) In sin 6 
xi) —In cos θ 


( 
( 

4 (1) tanx (11) ἴδῃ χα —x (111) secO (10) —cosec@ (v) —coté 
(vi) —cot@—@ (vit) stan? @ (vit) $sec? @ (ix) —41n cos 2x 
(x) gtan* x +Incosx (xt) 2Insindx (xii) —4 cot? x — Insinx 


5. (i) sin (ii) $sin ΞΕ, (ΟἹ) Fan! 2x (ὦ) fran > 
(v) In[u + /(1 + u?)] (vi) ) In[u + J(u? 
(viz) g[sin™* x + x/(1 — x?)] (υἠϊ[Ἐ} "ἫΝ ii 


(2x) tan! ἘΣ (x) ἐ In (4x? + 9) 


(1) xsinx + cosx (1) —xe* -- 4 (at) —3tcos 3t + $ sin 3¢ 
(iv) 2x sin x — (x? — 2) cosx (v) xsin”' x + ./(1 — x?) 
(i) xInx —x (vii) Otan@ — Incos@ (vit) (5x2 — dx + 4) c?* 
(ix) —ze '(sint + cost) (x) τἰς 625 (2 cos 3¢ + 3 sin 3t) 

10*(x In 10 — 1) 
( (In 10)? 


I 
7 (it) 1 (i) In2 (ii) τὼ (3 + 2,/2) 


xi) 9x° (BInx — 1) (xii) 


8 (i) ἐπ (i) Inin2 (it) yl (8 + 22) 
3 ᾿ αν code αὶ ὡς x 
9 (1) Ϊ rare | (12 ) In (222 ) In Ἔ ᾿ (1}) ἜΤΟΣ ΕΣ 
(v) ἴῃ (1 4 (wi) χ -- ἰπ (][ + &) (ἢ) Δ tan~* x? 
(viii) x (In x)* — 2x In x + 2x 
(ix) 4x3 tan7* x — ἐχ' +41n (1 + x?) 
] ee 
(x) sgn (=; tan 4 (xi) sec! x (xii) sin7’ x + ./(1 — x”) 
ro (i) +In2+4n (ii) Σ (wt) ἐ(π +2) (w) 1. (v) = 
4 8 4 8 2 ( 105 
π Ἢ 
(υἱ) 56 (vit) ξ(2 65. + 1) (υῖ)}} ἐπ + 4/3 (1x) a 5 In 2 
(x2 ) τ (xu) ἐπ --ἸΔ 
Exercise 9.88 
d d d d 
2 (ἡ ty=0 (ii) t= y (iti) 2x = y (δ) y= = 
d*y dy oo αἶχ 
Ὁ)" τ — We + Yy = 0 it) τ 
Exercise 9.8b 
; δ: stake a I 
1 (1) y=A& (ii) y= Ae ™ (111) ἡ τε xr +A (10) y= 
(v) y= Ax (vi) Wx? γῆ =A ae 


4 
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dy y dy  * 4, 9. 
(1) F ice er πα 7. +y° =A 
dy dy 
i) dx ᾿ dx ΙΝ. rae 
dy  2y dy ἌΝ 2 
ΕΣ, a ee ae ολόξοις, = A 
(222 ) Ree a cae Dy χὰ 
dy 1 ἀν 
—_ = --- SS SS “-- = A 2x 
(tv) dx 2y dx 2Y, ¥ έ 
dy dy 2 
—>= a= - —>—= = — = A 
(v) a = ha y 2x + 
vt OP: εἰ ἐδ, i ὅΤὶ x? — Qy? =A 
(v1) y 


dx x dx 2y 
(1) sinx =2siny=>y = sin’ (7 sin x) 
(1) y = 2tan™* 5x 


S = 4 e™, 57 000 6 100 years 


526 


II 


12 


13 
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160 000 km 
Ι 

171 years, 141 years 


26% 


x= 


dd 
τ, Ξ 4 -- ὅθ; 3.76 min (a = 21.3, ὁ = 0.170) 
dM 

ae + bM = f(t), 21.2 days 
du 1. gt kv 
—~ Sg ee Fy -- -- 

τ πα δ Tara 
Ι 2 

(1) 54m ( + (10) (ἢ 
Ι0 -- a 

ἐπ Ti ) 3 Κρ 14 132 min 


Exercise 9.8c 


I 


m CF N 


I 


(2) y=Ae*+xux—1 (t)y=x ae was 

(11) y = δ᾽ + Ax? (Ww) y= Ae* 7 — 2x +2 

eee θϑϑ fit decde oa 

Ae' Ὁ ὁ (sint — cost); 4(sint — cos ἢ) 

9.2" νι ) 

ΠΣ ἀν. BG 2a pet St eae 
R*? + w?L? 


LE 
RoE (R sin wt + wL cos wt) 
WRCE, 


ΤῸ wR2C2 (ORC sin Οὐ — cos cot) 
@ 


(22 ) gine: Ae tlRE 4 


ORCE, 


Lt wo R2C2 (ORC sin wt — cos wt) 
ω 


Exercise 9.84 
(4) y= Aer + Be (i) y= Ae*+Be™* 
11) y=Ae*+ Be ** (ὦ), y=A+t+Be™ 


( 

( 

(1) Pa ee Vee re 

(2) y= Ae + Be?® + $(Qx? + 6x + 7) 
(2) x = Ae'+ Be * 
( 
( 
( 
( 
( 
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G)yse* +2 x =et—e* (i) O=ke te * —2 
(7. — e 3); Brad (& 17°) when¢ = In 1.5 (= 0.48) 
= Ae 20% 4 Be 10008 4 4 


Exercise 9.9a 


b 
1 —cothé 
a 


3 (1) 2cosh2x (11) $sinh * (ii) x sinh x + coshx (19) cothx 
aa” 


(v) tanhx (vw) —sechxtanhx (viz) —cosech x coth x 
(υ111} cosech x | 
4 (1) coshx (1) Ssinh2x (i) tanhx (10) Incoshx 
(v) xsinhx — coshx (wv) x tanh x — In cosh x 
5 (2) 2sinh1 = 2.35 (11) ἐπ (sinh2 + 2) = 8.837 
3 5 2 4 
6 (ἢ xtGtat Qi as 


Exercise 9.9b 


1 (1) M2 (ἢ) In (2+ /3) (aii) Zins 


| δὲ 


2 (1) sinh”! (a2 couhe = Ga tee i; ese 
3 : 3 2 


~~ NO 


(v) cosh™* (xn + 1) (ut) sinh™’ (x + 1) 

(vii) xsinh7' x — /(1 + x?) (viti) x cosh7! x — (x? -- 1) 

(ix) χα +1) + sinh! x} (x) dxJ (x? — 4) -- 2 cosh”! 5 
3 (i) sinh7™'2 = In (2 + /5) = 1.44 

(11) cosh” ' 2 — cosh™' 1 = In Έ Ῥ 2) = 0.96 

(ii) ./3 — xcosh™' 2 = \/3 — $ln (2 + ,/3) = 1.07 

(wv) 3/5 + Zsinh7' 2 = 3,/5 + Ζ1π (2 + J/5) = 1.48 


Ι 
4 (i) —tanh7!= 
a a 


1/2 
(ii) ae i ΩΣ . # tanh ,/(gk)t; Je) 


Exercise 9.10a 
r (i) $(10/10 — 1) (ὦ) In(1 + ./2) (ai) 2(./2 - 1)α 
(iv) a(/2 + In (1 + /2)) 


2 6a 
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Exercise 9.10b 
1: (i) {ποῖ (εξ -- 72 +4) (a) 8παξ(ῶ. 2 — 1) (iit) 4200? 
(wv) “na” 


Exercise 9.11 


1 (1) na,ina*,@=434n+0 (i) 8a,3na*, φ = 4(n + 8) 
2 α΄. = 20 

3 a 

4 


ga’, ξα(. 2 + In (2) 


Miscellaneous problems 9 
I ᾿ 2.2.2 0. 3.57.2. 40 2:09, 2:69 


8 (0) ) Jn 


Exercise 10.1 
1 (1) -3—6 (uw) -2—8 (a) 10+ δὲ (w) 11] Ὁ 21 
(υ) —13 — 19: (uw) —3 — 42, —7 + 241, -- 527 — 3362 
(via) —20 — 202 (viz) —11 — 2: (ax) 61 (x) 50 
(xt) (αὐ — 10x? + 5x) + 1(—5x* + 10x? -- 1) 
(αἱ) (x® — 15x4ty + 15x?y* — y®) + i(6x°y — 20x°y? + 6xy”) 


tig) THE ἕω, 
50᾽:᾽ 50 
2 (i) =" (i) 028 τὴ (iii) 0.02(1 — 71) (ὦ) 1 + 2i 


(v) —i (vi) 0.008(—11 + 21) (viz) 1+ 21 (υἱὲ) cos@ + isin é 
3 p= 7k,q = 3k (κε Ζ) 7 
4 (i) (x® — 3xy?) + i(3x2y — y9) (ii) (x? — y? — Qixy) (x? + y?)~? 
7 y 
(211) x + x γῇ + 1 τ ety 
(4 + 51)(2 — 31); 83? - 1? = 717 + 437 
2 


+(3 -- 21) 


wooo ol 
NO 
| 

pale 
»»-ἰ 
+ 
<2 
© 
Nh 
I 
No|-= 
| 
—" 
+ 
{5 


Cp 
(io) 34.7 -— i) (@) -2, -44 J5i 
(vi) 0.1(1 + ./192) (2 — 1) 
12 (1) Perna (11) χ + 62+ 10=0 
(1) z2—4z+7=0 (ὦ) 27 -(1+i)z+ (2-—i) =0 
(1) 5,7,-+11 
(1) —1,7, -13; 2? +25 «0 5» Ζ,, z, complex, but not 
conversely 


13 
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m (i) ( -- δι +7) (ὧδ) 0.201 — 2ὴΠ -- (—-21)"] 
(1) HE] — (n + 1) + δ} 
155 A= i B= —ti 


Exercise 10.2a 
32,¢+2,=34+2i; Ζ, -- 2) ΞΞῚ -- 41; 242, = 5 Ὁ 5; 


ξΞι..0](-ἰ-- 7i); ΞΖ -  ρϑ(--ἰ -- Τῇ; 2=3- 4 
29 Ζι 
4 iz= --12 -- δὲ; 2-- 1 ΞΞ -ῬἩο τ 12]; 3z = —15 + 36; 
—5 -- 121 
— ΞΞ 158 -- ," “1 = —___—__; 2-- —119 — 
3z 5 — 361; z 169 Ζ 1202, 


zz’ = 169 (on real axis) 
6 Line segments AB and CD are equal and perpendicular 
8 —2+ 1,6 — 31 9 —4+ 41, -21,-5 —1 
10 6+ 21,7 —1;or0,1 — 31 
1 24+15; --ἰ -- 4 
12 (3 -- 2i) + /3(1 1) 


Exercise 10.2b 
1 —2—2i 5 c= —aw* — bw; c= —aw — bow’ 


Exercise 10.2c 
1) leisx (a) “2 cis (-7) (11) 2cis60° (0) 1 cis (—90°) 


( 
(v) 13 cis (π + τΔη 4s) (wi) «(13 cis (π — tan! 1.5) 
(vii) lcis (ἐπ — αὐ (wii) 1 cis (a — ἐπὶ 

2 (i) 2/3 +21 (i) —5 (ii) 5/2(1 — ἡ 


I 


(10) —8cos 15° + 8:sin 15° (v) 2 cos 80° + 22 sin 80° 
(vt) 3./2(1 — ἡ 
3 (i) ab = 5,/2 cis (π — tan™' 2) ) a/b = $,/2 cis (—4n) 
(122 ) vue = /2Qcisin (ὦ) a? = he cis (tan™ * 3) 
(v) b? = 10 εἰβ (π — tan™'2) (uw) a +h? = 54/5 cis (π — tan™' 2) 
4 a" =cis3an; ὁ" = 25" οἷβ (-- ζππ)ὴ; ab = 2 cis = 
@_ia,ils 
δ * 12 
᾿ . ὅπ . . ο 1 1 : 
5 cis (06+ 9); cis = --ἶ; οἷς 240° = —} — 3./3: 
6 (i) cis(+4m) (ii) cis(+3x) (ii) 3/5 cis (x + tan’ $) 
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7 (ἢ) z27+z+1=0 (mu) z? — 2 (cos 36°)z+1=0 

(11) z* —2(cosa)z+1=0 

11 Apollonius’ median theorem 

12 (1) circle centre 7, radius 2 (2) perp. bisector of —1, 21 
(111) interior of circle centre — 32, radius 2 (10) portion of plane 
inside circle centre O, radius 4 and outside circle centre 2, radius 1 
(v) ellipse foci +3, major axis length 10 

13 (1) region below real axis (11) region outside circle centre O, 
radius 3 and inside circle centre 3, radius 3 

14 First circle entirely inside region between concentric circles 


Exercise 10.2d 
r (i) 20cis230° (ti) εἰ5 (--ῬθΞ —) (tm) 1 (i) cis(—lgn) = -2 
(v) 2/2 cis Σ (v2 ) cis τ = --ἰ (vii) 2(1 + cosa) = 4 cos 50 
(vit) 4sin? 4B (— cosB + isin B) (2x) cis (-- 30) (x) 2 cis @ 
(xt) cis (α + β — 2y) (xi) tan ἐθ (sin θ᾽ + i cos 8) 
3 (1) 2costOcis$0 (uu) itanz0 (ii) 2 ες 0 


θ 
Gv) -΄ -- i tan (46 + 42) 
1 — sin 9 
5 Rewrite f = —— 
b—a 
pe ΩΣ τῆ AC 46 
ἼΣΩΣ = Bp δα Ss τως ee ABCC ABO 
ia fa oe ee 


similar in same sense 
7 Use AABC similar to ABCA 
8 See no. 5:/ = (1 — k)b + ke, etc. Add three equations 


Exercise 10.3 
4 zz, = rr. cis (--θ, + 02) 
4 z= 2* cis20>argz=OGor64+n 
5 (2) 8zz*#+z+2z*%* —1=0 
~~ Bal) ae) 4π 
Oat ὐΞυ θαυ 
(Ζ -- 2") (1 -- /37) 
8 143,15 9ο 2-ἐ ἐ 


ὅπ 
10 (Ζ + 2)(z* + 42 + 8); 2 cis m8 cis (+ 


4 
1ι cis (+60°), 1 (repeated) 
12 2+2,-2+1 12 lege aes 
τις OA 1 BC =>6 —c = dia (λε ΚΕ): conjugate and eliminate 4 
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7 {1 11 
ab ε|-Ξ0 
a* δὲ οὟ 


Exercise 10.4a 

1 (i) cis(—72°) (ii) cis 220° (ii) 108 cis (4 tan” * 4) 
(iv) cis 80° (υ) 27/* cis (-- 521) (ut) 32 cis 2ὲπ = 321 
(vit) 27° cis 120° (witi) 2.5 cis 15 (ax) J2 cis (—30°) 
(x) cis (50 — 4m) (xt) sec"Ocisn(gn — 8) (xu) cisa 


2 +2°>4(1+1)=+cisin; VYi= cis (F + Hn) 


4 —110°, —110°, —110°; cis (—94°), cis (—22°) 
91/12 cis (74° + 60k°), k = 0, 1, 2, 3, 4, 5. 
6 (1) l,cis(+3n) (ἢ) —1, cis (+92) 
(111) cis 50°, cis 170°, cis (—70°) 
7 32c® — 48c* + 18c2 — 1; 645 — 80c* + 24c? — 1; 


na 


5, -- 10° + ἢ : ᾿ 
1-- 102 4 5{Π᾿ 320" -- 320 + 6¢ 
8 χτ χ᾽ = -—l>z=cis(+120°); χ᾽ +2" = 2 cis (n120°) 


10 2" cos" 20 (cos 4n0 + isin4n0); sum = 25} cos" 46 cos 4n6 
11 tan 15° = 2 — \/3 
τα tan (12° 41’ + 436°) (k = 0, 1, 2, 3, 4) 


] 
Wp 


12 sknorkn + tan ' 


k 
15 cot* — k = 0,1,2 


Exercise 10.4b 
: 1+ 3,,-3+121,~-1- 31,3 —-—1 
2 (1) 2cis0(6 = 20°, 140°, 260°) 
(ἃ) ./2 cis θ(θ = —16°, 56°, 128°, ae 272°) 
(a2) cis (-- 205 + k45°) k = 0, 1, 2,. 
(w) cis (10° + 40°) k = 0,1, 2,.. 8 
(v) 25 cis (—45° + 490°) k = ε l, 2 " 
(vi) 25 cis (25° + 1605), k = 0, 1, 2, 3, 4, 
(vii) cis (223° + 90°k) : =0,1,2,3 
( 


viii) 2'/8 cis (8k + 3) τε’ k= 0, 1, 2,3 
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k 
4 (i) 1+ /3i, -2 (i) z= cis Es k = 1,2,3,4,5,6 


ee 3 
(111) cis (+2). cis (+= (10) cis (=). k = 0, 1, 2, 3, 4 
( 


v) /5 cis 0,0 =4(n + tan™' 2 4+ 2k), k = 0, 1,2 
. 4—5cosa — 3151ὴ ἃ 27k 

8.1 5 — 4 cos a ace 
(11) z = 51, 0.1(3 — 1), -- ἢ, 0.1(-3 — 1) 
(11) 2115 cis (415° + k120°),k = 0,1, 2 


Ink | 
(iv) cis = (k = 0, 1, 2, 3, 4); cis (4a + 40k), k = 0,1, 2, 3 


7 2 —5z+7=0; 2+2+7=0 
g Condition for equilateral triangle (cf. Ex. 10.2b, no. 5, Ex. 10.2d, no. 7) 


cos 58 + 5 cos 30 + 10 cos 0 
10 () ———_ 


16 
10 -- 15 cos 20 + 6 cos 4θ -- cos 6θ 


(22 ) 39 
35 sin 6 — 21 sin 30 + 7 sin 56 — sin 70 
111) 64 

(wv) sin* 6 cos® 0 = 27° (cos 100 + 2 cos 80 — 3 cos 60 

— 8 cos 46 + 2 cos 20 + 6) 

600 — 45 sin 20 + 9 sin 40 — sin 60 a6 

i -——— ; 3 x 2 
192 
12 38sec afl — (—1)"cos2na]; 4sec a(—1)"*! sin 2nx 
: 1)4 1 
13 Ne ae a tan ἐπα + cotza — (nm + 1) cot (n + 1)4a] 
2 sin γᾶ 


cos θ cis 6 
ta HOEY - i; — cos θ cis ἢ] 


Exercise 10.5 

r (1) a (1) —1 (a) e 1 (wv) Qsin 1 
(v) sh2cos5 —ich2sin5 (uw) cis3ch1 — isin3 sh 1 
(vii) In8+i(2k + 1ὴπ (( ε Ζ) (vit) In8 + i(2k + 4)n (ke Z) 
(ix) 1π. 2 - 71(2k + ζ)ὴπ (x) In2 + 71(2k — ξ)π 
ox —2 cos 3x + 3 sin 3x τὸς —3 cos 3x — 2 sin 3x 

4 @ ———SS -- -- ποῖ: νον See Ὁ τ τον 

13 13 

6 (1) sinxchy +icosxshy; 1° = 3(ch 2y — cos 2x) 
(11) cosxchy —isinxshy; 1? = $(ch 2y + cos 2x) 
(ii sin 2x + ish 2. ΓΞ ch 2y — cos 2x 
cos 2x + ch 2y’ ch 2y + cos 2x 


Φῷ © ms! 


10 


12 


13 


ANSWERS TO EXERCISES 533 
—4n + iln (3 + 2,/2)(+2zk) 
sinh? y = $,/3; cos? x = $,/3 
w=e’; coshz 
2υ ᾿ ε΄. --Ἰ 

1 — uy? --υ ae 61 
(1 — u? — v7)? + 40? 

EG ee) ae ec 1 
(i) e** (A cos 3x + Bsin 3x) (ii) δ᾽ (Acosx + Bsin x) 
(111) Acos2t+ Bsin2¢+13 (w) Acos 3ὲ + Bsin 3t — 4 sin 41 
(i) 365: sint (a) §(1 — cos 3t) + 3 sin 3¢ 


tan 2y = 


where οἷ" = 


Exercise 10.6 


I 


(1) translation 
(1) half-turn about 1 — 2: 
(111) enlargement x5 in point --ῦ 
(iv) enlargement x2 and rotation +32 about O 
(v) spiral similarity about O, x5 through (—tan™* 13) 
1) spiral sumlaTy about O, x 5 through 100° 
1) enlargement x > in point 2 
(viz) spiral similarity x v 2 through απ about O 
(ix) rotation through +32 about -- — i 
(1) 2 = z+3t (a) z’ = 3zcis 135° 
(i) z’ —i = 3,/2(-1 + ἡ) (Ζ --ξ ἡ (ὦ) χ' = —iz +2 
χ' --Ἠ 23 Ὁ Σὶ -- Ζ (wi) χ' =4z—-6—31 (om) Ζ' = 32 -- 21 
(vit) χ' = 32 -- Gi (x) Ζ' = --212 - 1+ 21 
) z= --212 - 4 - 31 
(1] Rotation +42 about point 1 (7) reflection in line y = x — 1 
(1) Reflection in real axis (11) reflection iny = x 
(111) reflection iny = xtana (10) glide reflection in liney = x + 1 
(v) glide reflection in y = x — 1 


Exercise 10.7 


I 


(1) Circle centre O, radius 2 (i) imaginary axis (11) y + x = 0 
(10) real axis 
(v) portion of line y = 1 — ./3x in first and fourth quadrants 
(vz) ellipse, foci Oand 1 -- δ (vz) circle, diameter 1, —1 
(viii) circle, diameter ?, 15 (ix) line segment joining 1 and --ἰ 
2 2 
( 


: x y 
x) ellipse 4 Ἔ a= Ι 
(1) Interior of circle (a) exterior of circle 
(11) circumference and exterior of circle, centre — 41, radius 3 
(10) exterior of circle, centre 3 — 2, radius 2 
( 


v) interior of ellipse in no. 1 (x) (vz) all points for which y > 0 
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The locus of points whose distance from —1 and from — 16 are 4:] 

has centre O, radius 4 

[2] = 1 

3zz* + 412 -- 4iz* + 4 = 0; 

zz*(1 — A?) + z(—a*® + A?b*) + z*(—a + 120) + aa*® — 47b0* = 0 
half-circle [Ζ ΞΞ Ὁ 
semicircle \z| = 2 from -- 21, —2, 21 

) semicircle Iz| = | through —I,z, 1 

the same moved +2 in direction of real axis 

semicircle |z| = 2 through -- 21, 2, 2: 

whole circle |z| = 1, beginning and ending at — 1 

vii) whole circle |z + l| = ] 

viti) quadrant of circle |z| = 1 


CR mn aa aa ami 
So. 8 τὸ. OMe 
ww NaS Ms LL 


emicircle [χ = 1 through 1, 7, —1 

1) the same translated +1 in direction of imaginary axis 

1) the same enlarged x 2 in origin 

the same rotated +37 about origin 

the circle [χ = 1 anti-clockwise from 1 to 1, thence along 
real axis to O 

(v) the semicircle |z| = 1 clockwise through 1, —i, —1, thence 
along real axis to — 00, and from + © to +1 


TN NSS 
~ 
τω 
~ 
ee 


τος 
Ξ 


(1 From 0 to 1, thence along parabola y* = 4 — 4x to 2, thence 
along γ΄ = 4 + 4x to —1, thence to 0 
(11) from 1 to e, thence round circle \z| = 6 toecis |, thence along 
radius to cis 1, and finally along circle ΙΖ] = 10] 
2 


2 
(11) from 1 to cosh 1, thence along ellipse ay + τ = Ι to 
42 γ 
ch (1 - 4), thence along hyperbola pS τ ΞΕ 00 
cos’ 1 51η4] 
cos 1, finally along real axis to 1 
Exercise 10.8 
1 Imaginary axis with y > 0 2 Anti-clockwise along ω] = | 


II 


I2 


Iz +4—-— 31] =5; 3x -- ἅν = —24 

Coaxal system of Apollonius circles based on the points | and — 21; 
the origin side of the line 2x + 4y + 3 = 0 

x = constant +> circles centre O, radius οἷ 

y = constant +> half-lines, arg w = y 

2—1,-1+ 2: 

Three quarters of the circle x* + γ΄ — x + y = 0, starting at O, 
passing through | — 2, and finishing at | 

Confocal hyperbolae and ellipses 
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Miscellaneous problems _ 10 
a=+2,b= +3; (22 — 42 + 13)(z? + 4z + 13) 
4 Rotation through +6 about c; _half-turn about «81 


2 3:, 21 
) the points of |z| = 2 in the third and fourth quadrants 


111) true, cf. Ex. 10.2a, no. 7 

i) r = 1; f(z) describes line segment [ —1, 1] 

11) θ᾽ = O, real axis, x > | 
0 = nz, real axis, x < —1 
θ = +32, imaginary axis 

7 IfU, P, Q represent 1, z, z*, and if 2 UOP = 0, then UP and PQ are 
inclined to real axis at angles 20, 36 (As OUP, OPQ are isosceles) 


] 
8 cos” τ cot? = roots of 5z* — 1027 +4+1=0; oT 


(2 
(12 
5 (ἡ) True (a) falsejeg.a =0,b = 1,¢=2 
(2 
( 
( 


constant +> hyperbolae 
= constant t > ellipses 


x 

y 

“ 2a sin ᾿ 
13 


12 
sin 2a —cos 2a 


Exercise II.1a 
1 (i) ¢a+ δὴ) (ἢ) 3α -- 18 (wm) fa +36 (w) Za — 56 
(v) —za + $b 


Exercise 11.1b 
2 
2 (1) 4h+2e (ut) fa+fe (i) ga + $b 
( 


iv) (v) (vi) ga + 3b + 6 
4 (1) -2b+ 3c (tt) $e -- ἰα (itt) —a+ 2b 


γ Ξε (1 — tha + ἐδ + Ste, 
j(a+b+e) 
5 λεέμ-ξ 


Exercise 11.28 

1 (1) 3(b+e+d) (ΠῚ) 1ίᾳα τ δ᾽.- ὁ - αἱ) 
(11) <(a+b+e+d) 

4 (i) ἰα -Ξ ἰδ - ἐς (i) 

4 12cm 5. 7: 


1.3 7) (ii) —0.4i 2.1) -- Ο.3Κ 


( 
26 
Ξ 
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Exercise 11.2b 
ι (i) (13, 2%) (ὦ) (Iga, 126) (az) (10) (8 
2 (i) (19,0) (#) (182, 0) (a) (6,0) (ὦ) (4h, 0) 
3 x = Za 5 3yla”, 37ζα 6 χ- 2.17] 


Exercise 11.3b 

(1) 3,18° 27’ (11) 1, 70° 32’ (az) —4, 161° 31’ 
(1) 2ὲ + 2k, 37 + 2k, 66° 55’ 

(1) 27 + 2k, --δὲ + 37, 64° 54’ 
( 
( 


I 
2 


111) --δὲ + 37 — Καὶ, --δὲ — 37 + k, 115° 23’ 
ἢ) 37 -- k,j — 2k, 45° 
3 Lines meet at (2, 1, 1), 70° 30’ 


4 (3,3) 
— 2u, — 2t 
5 ae t u ae 
| ae | ἢ = τῷὸ 53> Te 
5 


Exercise I1.4a 


1 (i) r= +j+2(1—-A)k; x= — Ξ 


(i) r=A+(L+AZ+ (2+ Ak; x= y-1LEz-2 
2 (i) w@tyt3z=13 (ἃ) x + 2. τ 52 ΞΞ ὃ 


4 (i) 61°53’ (a) 38° 13’ 


Exercise 11.4b 


: (1) ἐ- 27 - 3k,i-—j—k,128°O (ἢ) ἃ -- 7.71 + k, 120° 
2 (2) 55. (it) 38° 34’ 

3 (1) 3 (1) 7x + 56y — 5 = Oand 16x — 2y + 35 = 0 
4 (1) 6 (1) 3x + ὅν +4z+6=O0andx—y—-8=0 
5 (2) - 17’ (ἢ) 54°43’ (ti) & 

6 ..5 7 3 

8 2x + 24y¥ —z=9, (4, 1,1) 


Exercise 11.5 
1 ti-j,tt+y, τῆ -- ξ 71: 2k 
2 (i)i-—jitj,i-—jg+t2k (ἃ) 90° (i) 54° 447, 90° 
(iv) 39° 14’, 39° 14’ 
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Exercise 11.6 
I (1) γΞ 99 (a) 2 6“. 2 92. 0 4,35: 


4 —20¢7,10; —2027, 2k, 10 


Exercise 11.7a 
r (i) 3x 10*Ns (@)10°Ns (1) O.75Ns (wv) 2.7 X 10°27 Ns 
300 Ns (1) 12Ns (ἢ 80Ns (w) 2Ns 


(2) 

(i) 2000Ns (i) 2000Ns (at) 2 πι5. 

(i) 1000Ns (ὦ) 1000Ns (cz) 1 πι5. 

(i) 667Ns (uz) 667Ns (a) 0.667 τὴ 5. 

(1) —40000Ns (ii) 260000Ns (ii) 26ms ' 
(i) —20000Ns (ii) 280000Ns (i) 28ms7' 
(1) —20000Ns (i) 280000Ns (a2) 28 πὶ 5. ἢ 
4Ns, 200 N 10 400N 

(1) 0.00162 = 0.005Ns_ (ἢ) 0.005 N 

50 N 12 1500N 

50 N, increase 

7.2 x 10°Ns,7.2 x 10°Ns,7.2ms ' 


oon Oo kf | Ν 


με "π᾿ μΜὶ μ᾿ 
wo Ὁ Νῷ »μ»ῃ 


Exercise 11.7b 


τ (i) 1+ 47 42K (i) ~38 -- 97 + GR (iii) 2 + BK 
(iv) i+ 1.23 

2 (i) e+ 5+ 3k (ii) 3ὲ -- 3) 45k (ἐδ) δὲ + 3) + 4h 
(iv) 4.54 + 3.6) + 2k 

3 (@) -%+10k (i) —247— 1.6 + 1.6K 

ΠΑ ΡΟΝ τ 


96 νυ ᾿ 436 


Exercise 11.7¢ 
1 (i) @+37+k (ἢ) 2+ --ἰΚ 
-(1.8 2.4 
ΞΕ 
0.8 8.4 
4 (600, 400, 20) 4 1600N 


1 


N 


2 = 9 =o 
go kms ,oa5 kms ,. 9 8 


5 
6 (i) 305, 8v (i) 25.2 πιὺ 


Exercise 11.7d 


1 12,33ms_! 2 3.6ms ',0.4 
3 —3,+2ms ' 4 lms ',¢% 


5 10,7.5ms ! 8 0.952, 40.7 m 
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Exercise 11.8a 


(1) 3001 (i) 5.48 πὶ ς᾽ ', 4.27 πὰ 5. ' 


8 000 N, 32 000 N 4 32x 10*N 

100 J, 0.064 

(1) 4x 1051 (Δ) 5000N (i) 10000N (av) 20000N 
mv? of 

OF 47 24ms°; 


2.1,2.66ms '; 0.451 
1.8, 2.8 πὶ 5. 1:0 
(1) 18.2N (az) 6.6N 


Exercise 11.8b 


Φ On AO "ἢ. | N »ῃ 


(i) 8N (i) IN (ii) 0.2m (iv) 0.05m 
(2) 

(i) 2x 10°Nm7! (i) 0.0043 
(ἡ) 5x 10*Nm™! (uw) 2.51 
(i) 4m (i) 4001 (ii) 800J 
(1) 

4.06 x 10*J, 15m, 39.8 πὶ 5. ', 53.9 πὶ 5. 2 
(1) gR (ii) 29 Κ 

J2gR (απ 11.3kms~'), /gR (9 8 Καὶ 5. ᾽) 


Exercise 11.8c 


~ 


(i) 81 (ὦ) —4J (i) 0 

(1) 36x 10°S (i) 45.9 m 

(1) 9.9 πὶ 95. (1) 10.8 πὶ 5. 

(1) 10 πὶ. (a) 5001 (i) 5.1m 

(i) 3.13ms7!,0.39N (i) 4.48 τι 5. ', 0.98 Ν 
(iii) 3.83 τῇ s7!, 0.73 N 

() 7861 (i) 6.26 ms” 1 (it?) 1180 N 
11] ἢ 5 

(1) υ = 2ag(1 -- οο5 0) (tt) mg(3 cos θ -- 2) 


Exercise 11.8d 


~—I Qu »ὮὮὉ N »ν»" 


2.26 kW 

9.5 kW | 3 0.039ms ” 
(1) 1.47 τὰ 5. 5 (i) 147ms? 

4.9 x 107 W 


2 160 N, 1 700 N, 80m 
95Ν 5,25 Ν, 0.85 πὰ 5." 


0.6 J 


0.161 (ii) 2.5 x 10733 (ii) 43° (ὦ) 0.255 


(iii) cos 


τὰ +x) (i) 25χ2 (ii) 400 + 200χ -- 25χ2, 9.7 πὶ 


= 


2 
3 


- 48° 
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M. 
8 ekg sot, 5 Mgv W 
v 


F 
10 mi + kv? = F(v < U), mis = — — bv? (v > U); 
υ 


F 3/FU 
ff (F< kU), [πὶ (F< kU*); k= 05; 


4.98, 4.78, 4.38, aes ms~2: 10.68 


Exercise 11.9 


1 (i) #= —4x (ii) 3.145 (ii) x =3cos2¢ (w) 6ms' 
4 (1) 3 (2) 628s (1) ξ (w) 0.13 
4 (1) 063s (ua) 0.68s 
a 
7 (ἀπ + 2) 9 
8 
8 mg(l -- cosnt) 
ὃ; τ ποῦν : 
V(gl) * Vg 
᾿ 47 ml 
Pie pee ms 
10 τε 0, 7 r 


Exercise I1.10a 


: ()itk (i) -5i+3j-—k (iti) 2 -- 7 


(wv) —lli+99 —5k (v) --ἀὶ -- 33) +k (ut) —12t — 67) — 12k 
4 (1) Force 6¢ + 37 along 3x — 6y + 4=0 (1) Couple 4k 

(111) Equilibrium (10) Force 27 + 157 along 15x — 2y — 2 = 0 
4 (30P, 30P) along x — y = 3a; force (30P, 30P), couple 10aP 


Exercise 11.10b 


700 N, 800 N; 75kg 
(2) 0.213 N (a) 0.106N 
3W (4x -- y + 6), 3W(—x + 4y + 6) 
113 kg, 8; kg 5 Wot B 6 40° 
(1) Wtan ἢ tan 15° © 0.27 (ii) LW, tan 30° © 0.58 
(111) 4W, ἐπ =~ 0.29 
5 mgl? πισί(18α -- 51) 
6a ᾿ 12a 


μή 


Cre ὦ Ν 


10 δἰ from A, 
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2d — lcos θ 
/ sin @ 


12 (i) 2,/13P,6x — ὃν = 5a (ii) Pa (iii) 6Pa 


Exercise 1.118 


2 31+ 7, towards O 


Exercise 11.11b 


I 


3 
5 


$M (a? + ὁ) 2 5Ma’, 3 Ma’, 2 Ma’? 
3 Ma’, 2Ma? 4 2Ma?*, 2Ma? 


2a7,2 547,2 
2Ma?, 2Ma 


3M .. M 
(1) 335 Ma? (22 ) 0 (a? + 4h?) (222 ) 0 (3a? + 2h?) 


Exercise 11.11¢ 


I 


ὋΣ OO αὶ OO Ν 


9 


10 


(1) I2kgm*s7!; 36 J 

(1) 0.48kgm?s7!; 0.96 J 

(11) 0.24 Κρ m*s7'; 0.48 J 

(Ὁ) 96 kg m?*s7!; 288 J 

(v) 168kgm?s7!; - 877 J 

(vi) 7.15 x 10°%kgm?s~'; 2.60 x 1079 J 


(1) 2F/ma; (11) maw/2F; (1) maw?/4F 
9.82 N, 1.88, 58.2 N 
62.8 rads 7, 126 rads‘, 78.5N 


3.65 kg m? 

(1) mga, ./3g/2a, 3g/4a; (11) 2 mga, ./3g/a, 0 

38.7 rad ς΄" 

ae 3g. shi δὲ 320 
(1) ¢mglsin@; (2) 6= τοι si 6; (i) 6= era 
(w) 2n./2l/3¢ 

Mgx , δὰ 2Mg 

(1) 2 ia ae |) oe 


2n./5a/3g 


Exercise 11.12 


I 


2 


3 


(i) ΜΙ, ΤΟ (i) ΜΙ, (ἠδ) ΜΙ, 5 (iv) τ! ἡ TT? 
(vi) T72 (vii) 1 (vii) ML? (ix) ΜΤ (x) Μ- Το 
(xi) MT~2 (xii) ΜΤ΄ 2 

(i) Τα --- (ii) h oc = (iii) vce ff iw) foe J 


(v) vo x/gd 
T = kv*pA; Μ᾿ 'LT? 
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Miscellaneous problems 11 


3(a° = 217 
7 Distance cu ee 5 from common base 
cos? B + cos? y — 2 cos ἃ cos β cos 7} 
9 sin? a 
᾿ ἂν rn 
— = -¢ 
I a v+eg 
Ι = "ἢ 
r=-(t+e*—l)g+—-(l—e “)u 
C C 
1: A=18%+3; B= -2- 37 
Vsina Ὁ 5" gt 
12 τ k +h)a-em—-# 
13 Ap(l — k)u?; 4.5ms 
15 3h35 min, 6.19 x 10° Ns, 1.03 x 10°N 
16 αι 
& 
l A 
1 Peat + If jaz alk 
Ps ᾿. ἜΤ ] 
--- sin 
(1 + 2A/mg) 
2 
18 G moves with constant velocity —; = 
3m 3m 
19 P = 4Av(v? + 2gh) x 10° * 
. Ave 
ue 107’ 
dV 
20 Μ "017: ae = —)ApgV 
Exercise I2.1a 
1 1.5,0.75 2 (1) 2.92 (w) 5.83 (11) 1.46 
3 4.5, 8.25 4 5.96, 1.52 
5 417 6 1.94, 1.44 
7 1.73, 3.13 8 5.58, 0.60 
g 3.61, 6.69 10 £2.78 
11 29.75, 230; 0.5 
12 p(T = 30) =p, p(T = 45) =1—p; 45 — 15p, 225p(1 — p) 
p(T = 30) = 9p, p(T = 35) = 9, p(T = 45) = ( — 3); 


40 — "3p 


541 
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Exercise 12.1b 


t 
epee ν΄ 2 (i) 2,1 (ii) 18,3 (ai) 50,5 
48 8. 3,40 
36. --- Bae SNe 
5 0.59; 7,6.3; 8.26 6 4.49, 2.47 


7 (1) 0.96,0.88 (11) 1.58 
9 8,7.92; approximation np; 1.01% 
ro (1) & (ἡ) i, 22 40 


27> 329 9 
1 2 1 
11 (Git gf Ὁ 2"; --οἰς, 35 
2 
12 ------; 12,60 
(6 -- 5t)?’ ' 


13 (i) (4t* + Ξε 25.0.10 (di) 16.67, 11.78 


Exercise 12.2a 


ty, 
ὦ 
-ς,, 
wo 
Reo 
on 
~ 
~ 
- ΄ 
0] 


0 x <0 
2 

τε 0<x<3 
D(x) = Pe 

2x —-——-3 3<x<4 

] 4<x 
“6 

0 x<-l 
4 k=3; (1) +e (ἢ) στ; O(x) = 4(2 + 3x — x?) —-l<x<l 


l<x 
0 x <0 
5 a= 5; D(x) =44(1 — cosax) O<x <1; ()2 (ἢ) 2 


Ι 


Ξ στ (1) 0.135 (1) 0.145 (az) 0.018 


6 φί(ὴ 


Exercise 12.2b 


a ἜΝ. 

τὶ 4 0,5 
[12 

9 04 π 
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6 o(x) = fora < x < ὁ, and zero elsewhere; 


Ι 
10 φί(ὴ =—e *; 0.135; 5000 
μ 


a+b (6—a) 
19 


Ϊ 
ΙΙ hae 1.53a; (1) 0.53 (av) 1; A= 1.28 
a 


12 (1) ἀ --.Ὄ (i) Σ (ἢ) 1 
12 A,A 


Exercise 12.3a 


1 (1) 10. (1) 92 (11) 33. (10) 23 

2 (1) 6.7% (a) 10.6% (22) 0.6% 

3 (1) 0.319 (a) 0.145 (11) 0.030 (2) 0.537 

4 0.753 5 (1.6 (wz) 210 (11) 34 

6 66.77, 2.28; (1) 7.9% (uw) 66.2 cm 

7 3.63,1.71; 2 (since x takes only integer values) 

8 (1) 0.675 (a) 1.65 

9 (1) 91% (a) 37% (at) 530.8 

10 63¢ 

Ir (1) 54.6min (2) 44.7 and 49.3 min (11) 39.4 and 44.7 min 
12 (1) 5.1% (1) 103.26mm _ (111) less than 1% exceed 107 mm 
123 1.004 cm, 0.020 cm 


14 14.6, 5.2 months 

15 74.35, 3.23 

16 7.52 a.m. 

17 (1) 848 a.m. (11) 5 min, 118, 4 entrances 
18 7.9%, 15.5%; 24.45 mm, 6.1% 


Exercise 12.3b 

r (1) 0.018 (1) 0.729 (22) 0.067 

effectively zero (11) 0.999 (1) 0.89 

0.052 (2) 0.198 4 (2) 0.131 
(1) 0.018 (1) effectively zero 

0.827 (1) 0.062 (az) 0.021 
(22) 


“VIA Oo GO N 
POS PETES, BOE GN. TSS 
o>. 

Θ 
NO 
NO 
NO 


(ii) 0.063 
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8 (i) 7.8% (ii) 95.8% (iii) 0.8% 
9 n=610; 0.09 


Exercise 12.3c 


2 (i) 0.5,0.1 (ii) 0.5, 0.05 


Ji0 


4 2.5,0.112; 0.037 


3 1,1; 1, 


Exercise 12.4 


228, 211, 98, 30, 7, 1, 0 (to the nearest integers) 

Theoretical frequencies: 3, 9, 16, 17, 14, 9, 5, 2.5, 1, 0.4, 0—close 
correspondence with observed frequencies, suggesting lucky and 
unlucky tenants 

9 2,1.97; sop & σ΄, asin case of Poisson distribution 


10 650 
τι 1.05; 0.350, 0.367, 0.193, 0.0675, 0.0177; 105; 27 
14 (i) 0.9975 (ii) 0.849 
| 
1+ é€ 
15 20,60, 90, 90, 67, 40, 33; (1) £157.50 (a) £167.50 (ἀϊ) £160.75 
16 (i) 0.406 (i) 0.184 (ii) 0.37 


1 (i) 0.368 (i) 0.368 (iii) 0.080 
2 (i) 0.330 (i) 0.865 

3 (i) 0.018 (i) 0.195 (ii) 0.156 (ὦ) 0.0003 
4 (i) 0.165 (i) 0.269 (ii) 0.373 

5 (i) 0.003 (i) 0.037 

6 52 

7 

8 


14 ἰ --ε 74; 


3 


Exercise 12.5a 
2 (1) aE[x] + bE[y], a V[x] + 6°V[y] 
(11) aE[x] + bE[y], a?V[x] + 2ab Cov [x, y] + 6°V[y] 
3 (1) aE[x] + ΒΕΓ] + cE[z] (x) αὐ VEx] + Vy) + Vz) 
10.5, 8.75 5 10.5, 14.58 6 0, 2.42 


7 Bob has the advantage; Alan’s variance = 11.67, 
Bob’s variance = 11.25 


12.25 

(i) 62 (i) 134 (ii) 262 (iv) 334 
10 60 minutes, 3.16 minutes; (i) 0.0057 (i) 0.057 (iii) 0.565 
τι (i) 0.168 (ii) 0.027 
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12 δ᾽ Aw, J(d A207); 150 5, 8.1245. (i) 0.7% (it) 11% 


12 0.638 


14 


V[z] = [A,a, + Ab)? + (Aya, + λοὐ.)1σ3 


Exercise 12.5b 


I 


“I MQ Oo CO 


(1) 06.057 (a) 0.57 2 (1) 0.072 (uw) 0.165 
0.048 4 (2) 0.115 (a) 0.008 
(1) effectively zero (22) 0.998 7 

0.001, suspicions justified 

136 8 96 


Exercise 12.5¢ 


58.5, 31.7 2 5.08, 0.55 
(1) 1.79 (1) 1.64 (11) 1.61 
(72 ) Di = | 
1 
σ' πη --ἰ 
VEX] = —; Efs?] = σ΄: = ---- --------- 
[1] Ξ- --; Ε[ΡἼ - σ; 4=———, 


Exercise 12.6a 


10 


II 
12 
13 


14 
15 


Significant at the 1% level 

Significant at the 5% level 

(1) Significant (2) Notsignificant (111) Significant 

(wv) Not significant evidence against the hypothesis 

Significant at 5% level that techniques are effective 

Significant at 1% level that support has decreased 

(1) Not a significant result 

(11) Significant at 1% level that claim is not justified 

(1) Significant (2) Not significant evidence against the hypothesis 
Significant at 5% level that cards were not drawn at random 
Consistent with hypothesis; significant at 0.1% level that hypothesis is 
incorrect 

(i) 0.0004 (i) 0.043; not significant at 5% level that germination — 
rate is less than 75% 

Significant at 1% level 

Highly significant, almost conclusive evidence 

(l— X)"; (2) 06.107 (a) 0.028 (22) 0.001 
Significant at 0.1% level that hypothesis is incorrect 
0.109; (2) 527 (1) 537 

475 
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Exercise 12.6b 

(1) 58.8,67.2 (2) 58.0,68.0 (27) 56.5, 69.5 
(1) 1150.1,1 153.9 (a) 1 149.6, 1 154.4 

(1) 154.6, 163.6 (2) 153.2, 165.0 
7395149 (Π) 7.32, 7.52 

(1) 54.1, 60.7 (22) 53.05, 61.75 

Highly significant that there is bias; 0.2 to 0.6 
1.996 5, 0.0645; 0.0065; 1.98 to 2.01 

No; yes 

eK; τὸ 0.776: χὰ 0.163 


Oo On HO hm OC Ν με 


Exercise 12.7a 
2 y= 0.58x + 8.4 
3 W=0.36T + 40.4; W = 58 
4 y= 1.4% -- 26.3 
5 (1) y = 0.28% + 69.8 (1) y = 3.6% — 36.4 
6 y = 33.6x + 8.1 
7 y = 1.24x — 24.4 
8 F=0.91 — 6.3; 9 
9 y= —1.28% + 20; 16.2 
io (1) y=08x+ 1.5 (2) y = 1.9x — 3.1 
1 19.14,15.03; y = 0.33x + 8.8 
12 log,, Y = 0.021X — 37.2 
13 (2) 15.2,1.72 (a) 149 (1) 2.3 (10) a= 1.5, ὁ = 0.015 


Exercise 12.7b 
2 0.88 just significant at the 0.1% level 
4 0.99 highly significant 
4 0.92 highly significant 
5 0.28 not significant 
6 0.72 just significant at the 1% level 
7 
8 
9 


0.23 no significant correlation 


0.35 
correlation coefficient of — 0.84 significant at the 1% level 
10 0.65 Ir 0.43 12 0.65 


13 (1) 0.61 significant at 5% level (a) 0.45 not significant 
(111) 0.93 significant at 0.1% level 
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Exercise 12.7¢c 


wo won AF | ND κ᾿ 


ie 
° 


oe! 
με 


p = 0.66, t = 0.47 not significant 

p = —0.08, t = —0.02 tastes quite dissimilar 

(1) 0.79 ({) 0.61; both significant at the 1'% level 
0.90 5 0.22 

p = 0.79 significant at the 1% level 

(1) p=0.92 (nu) t= 0.78 

(1) t= —0.71 (a) not quite significant 

ACBDE, ABDCE, ABCED; 120; 3% 

p = 0.74, t = 0.62 significant at the 1% level 

t = 0.67 significant at the 1% level 


Miscellaneous problems 12 


I 


Oo ὦ .ὕ. 


10 
Il 


I2 


13 


14 


15 


9\ /1\* (5 ᾿ Pea VN gy cay 

(3) (3) (5) Goi) 

4(4t)?e* 5. λ(λὴ "4 ™ 

9!’ 8᾽ (Kk — 1)! 

(I-A +8 9... 1-4 4 2ὴ 
ἜΝ Ε “8 pao on 
1 (ἢ) 32 min 

68 (ti) 404 (ii) 471 
(naar ce 


r! 


36, 180 
50m + 16. os 
(ἢ) 3 (() 27 
(1) —np(1 -- Ἢ (it) A= 1.96/n (iit) 0.077 
(1) p(A,) + p(A,) + p(A3) — ρῥ(4,.42) — p(A2As) 
— p(A3A4,) + p(A,4,4s3) 
Σ, ρ(Αὴ — dd Pl A,Aj;) +++» + (-1)""' p(A,A,...A,) 
I We 1 
(2) 1 - — ΟΝ -+ (—1)" Ἢ which tends to 6 


—(2N-r-1\ 1) (2N—2\2N-1 
ae ἌΓΟΝ DONE AN. ἢ 
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Exercise 13.1a 


121 46 33 103 15 205 17 
66 41 12 0 0 0 0 
"\o59 51 4] 2 f 50 4 88 4 
108 45 12 0 0 0 0 
ll 4 37 Ὁ 

1041 22 19 26 265 τ 

3 [106] 22 21 18 010) 5 ( = a 


987 18 35 18 090 


Exercise 13.1b 


24 4 1050 3 20 = 51.0 
r {32 0 1507 5 32 47.1 
28 9 967 5 19 50.9 
2 b6=3a; d=5c; f=2e; g=bt+dif=3a+ 5e + 2e, etc. 
h is independent of the others, including g 
3 b=3a; c= -,24: 3a+ 3d—f=0; a—2d—e -- 0, etc. 
4 col.2 = —3col. 3; replace 5 by —4 


5 rowl — row2 —row3=0; 2col.1 + col.2 — 7col.3 = 0; 
col. 2 — 7 col. 3 — col. 4 = 0, εἰς. 


x = —30; yes: 8col.1 + 6col.2 — 47 col.3 =0 
H,O + C,,H,,O,, contains same as 6C,H,O, 


columns 1, 2, 5 are 
linearly dependent 


Exercise 13.2a 


L l 3 ΠῚ 
ἜΝ: 2 2 ΕἾ 
oe ὁ 1.1 ὧδ 1.001. Ὁ} 
a 2 2 2 ee 
1 3 1 4 a 
29 5 "ΟΣ RB yp 
J3 1 f 111 17 
a 2 ν2 «2 \ V2 V2 
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ahh ΟἹ 20 Ο. (01 0-1\ /-06 08 
a) ( δ᾿ ( 0 ) ( ): (| ᾿ ( 0.8 τὶ 
3 0\ (2 0\ fi 0 1 0 
t ἢ : ἣ . 2 4. ἢ 


2 «x and y stretches; enlargement and half-turn; enlargement x5 and 
rotation; x-shear; y-shear; reflection in y = —x and enlargement; 


--Ἰ 1:52 0 
e : - : ae ἣ (x-shear and-enlargement) ; reflection in 


y = x tan 4a; reflection in line y = x tan (45° + 3a); rotation +40°; 
all points map into line y = ἔχ; (x, y) +> (x, x) lying ony = x 
3 Reflections in y = 0, x = 0, y = +.; rotation through nz, (n € Z) 


2 BV F203) 0. f a 4 20 
1 ( ἢ: ο 5 ~3 a 
3 1\/2 -1 0\ (7 -1 3) 
( (; 2 y= —2 > 
( 4 ες -Ἱ 3) =( 2 —16 ~"s) 
—2 3)/\1 2.3 \-1 8 Of 
all lying on line 2y + x = 0 
8 ( ἡ Δ. ‘= ( a ae ) (parallelogram) 
Lee ἃ ἢ ἡ -i 120 
-2 2 3 —10 
(τς τ ὦ 


τὸ. WZ 7 —6)\ vectors in direction (1, 1) unchanged ; 
—18 19 vectors in direction (1, -- 3) x 25 
1 1 
1 ; ᾿ shear in line y = x 
τ 12 


Exercise 13.2b 


1 0O 0 0 1 0 cos2a sin2a 0 
r (10 - 0] (aw) {1 0 0} (az) | sin2a—cos2a 0 
0 0 1 00 1 0 0 ] 
] ~—2 -—2 1 0 0 0 1 0 
(wv) ¢{-2 1 ἢ (v) [ —1 O} (0,111 0 0 
“-2 --ὃὸὁ 1 0 0 -ἰ 0 0-1 
—-l1 2 2 0O+1 0 
(vii) ὦ 2-1 2] (vz) {+1 0 0 
2 2-1 0 0 1 
0 —,/2 —,/2 cos 0 sin@ 
(1x) | τ} 1 “--]Ἰ δπά inverse (x) 0 1 0 
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5 0 0 1 0 0 
(xi) 10 5 0] (xi) 10 0 0 
005 001 


1) Reflection in y = 0 (71) orthogonal projection on to z = 0 
1 


; 1) 120° rotation about x = y = Ζ (ww) quarter turn about x-axis 
(v) x and y stretches, z reflection 
(vz) enlargement x 3 and central inversion 
(viz) rotation through tan” ' 2 about z-axis 
(υ111) rotation through tan” ' ξ about x-axis 
3 —1,0,1,1, —6, —27,1,1 
5 All points have images in the plane x — 2y -- z= 0 
O-1 O\ fl -1 1 1 =21-1 -Δ -!1 
61} ὁ Of t1 1-1 1 --ὶ 1-1 -- 
Oo ΞΡ VE 1. 1. 15] ῷ. τῷ} b=] 
al-sl fal. teal F&F d 
= leak Δ Ἵ {3 eet ed 
—-l1-1-1 1 1 21-1 1 
8 Glide reflection in line y = x; + + translation; + — and — + glide 
reflection; — — half-turn 


Exercise 13.3a 


6 1 4 πὸ 
I ap -(° A za = (1 ᾿ CD={-2 0 4 
8 -12 4 
iat ἢ 1 0 2 9 3 9 
pe= (ΠῚ 4 ee ἈΦ ᾿ a -ἰ 1 - ᾿ 
7 8 3 
CA=[ 8 2]; CB=]2 DB, DA, AC, BC impossible 
12. 8 8 


8. 1 


2 1 3-1 4 
BR ter ok oD) 5 —7 —4 —1 3}=(0 0 0) (or transposed) 


D 


IO 


II 


19 


14 
16 


17 


19 
20 
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a ka P 
a ( Ὶ where δὲ = 1 — a? 
--- —4@ c—a 
k 
I, A, A? = ΒΞ. A°, AB, BA, A°B, B°A; twelve in second group 
k 0 0 
Only scalar matrices: [0 & 0] commute with all square matrices 
0 0 k 


Multiplication of complex numbers: (a, + 164) (δ. + 184) 


1 0 0 
0 1 ἃ]: bane es 


—27 — 11 —14+ 10 
ae ὦ 27 τ + 10/2 
—14—102 21-—1h 
—21 — lle —14 + 102 
13 — —] 
δα. ᾿ ἂν i » etc. (Fibonacci) 
—21 34 —144 233 


Ρ 4 es eee ee ee 
(' theses = Fs = (p 2) (a 4) (ὃ #0) 


Ι --ὶ —| 0 
C= Oo: = 

(ro) = (0-1) 
a=lb=le= —-ld=4,c=0,f=3,¢g=2,h = -1k=-—-1 
cos 8 = Σ᾽ xyX9/./(X x7 L *2) 


1 0 g\ [x 2. 9 x 
(x y [01 fifty}, αν 2 0 4 Ie} ly 
g f ε \I -3 15 0 \z 


Clockwise rotation of quadrilateral about O through ἐπ 
Five points in three dimensions rotated through 120° about line 
x=y=2Z 


Exercise 13.3b 


I 


0 - , 01 —2 0 
AB = = BC = CB = 
5-( 1) κατῷ ee 
0 -2 4 0 1 2 
= €C = oe D2 = * 
sen ea-(2~2) αν ἢ στ ἢ 


A, quarter-turn; B, reflection; C, enlargement; D, shear 


2 P, 120° rotation about x = y = z; Q, half-turn about z-axis; R, re- 


flection in plane x = Ζ; S, enlargement x 2 


3 R,R, = R,R,, formulae for cos (α + f), sin (α + B) 


592 


10 


II 
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3 —| 
(2) ( 5) 0 enlargement and half-turn 


0 -- 0 --Ἰ 
(1) ( )( ; ᾿ enlargement and reflection 


i . 1 

Ν᾽ COS 4π = Sin 47 

(4) 2] ἢ : enlargement and reflection 
sin 37 —COS 3% 


cos &@ —sin ἃ 
(iw) 13( . ,tana= + 
sin& cos¢@ 


0\/1 —15 
: ) ( εἶ shear and enlargement 


lo allo ἢ 


0 --Ἰ 
( ) χ and y-stretches and quarter-turn 


(vi? ) > ἢ = * i) (; ᾿ ᾿ | shears and y-stretch 
2 0 0\ /-1 O O\ /O 0 1 
(1) ( 2 0 O-1 OF 10 1 | 
0 0 2 0 0--| \l 0 0 
1 0 0\ /-1 060 2 0 0\ /O 1 O 
(12 ) ( 1 0 0 0 1} (10. 2 ΟΠ] O 0 
0 0 3 0 1 0 0 0 2/ \0 0 1 
1 0 0O\ /O 1 O 
(0) [0 1 01 O O 
0 0 0 \0 0 1 


Suspect 120° rotation about line 2x = y = 2z 
Vectors (0 0 z) 


e.g. permutation matrices, rotations, etc. 
0 
e.g. from no. 7, M = Ν᾿, X = {0}. 
] 
Must be an orthogonal matrix, with unit determinant (see Ex. 13.44, 
no. 13) 


0 
Composition of a permutation matrix with one of [ 0 +1 
0 


13 Reflection, rotation and translation +k parallel to y-axis 


Exercise 13.4a 


2 


4 


Eft Robs (2. 0 0 --2 
1 wy\r -1 1-1) \ob -ὰἜὁδὰ --ῶς 


ad # be 


2) 


<<) 


Φ 


Ι 


II 


12 


15 


17 
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4 --ΞῬἸ -2 -2 35 1 —55 
[ 2 121]: } 11 —1 —6}; 30 
—-8 1 4 4-5 - 105 
2-4 2 
| "ὦ ἢ = inverse X6; (x y z) = (—1 --ὀ-ἰ 12 
-ά -4 § 
1 —a ac— ὦ 
O 1 -—-e 
0 0 1 
A C B 
ifs 4 ef 
C BoA 
a 


1-1 1 a 2 
4 1 1-14; | d)= 41 
—!] 1 1 C 3 
M? = I, see Ex. 13.38, no. 6; e.g. half-turn, reflection, etc. 
—-25 4 3 
Μ᾽ = 96 —4 —3](M’)~' = (Μ΄ 
—33 5 4 
cosa cosB -—cosa sin B sin & 
sin a cos B sina sinf —cosa@] matrix orthogonal; rotation 
—sin B cos β 0 
29 31 13 1-2 1 2s “5. ἢ 
4{ 1-1 —1]}; [O 1-1}; ἐπ -8 2); 
9 9 3 0 O 11 14 19 —] 
1 -2 -—1 1 1 1 
$4 ]-1 5 43]; E22 
3 ~12 —6 Ι 2 38 
l+a —a —a 
(1 — a)" '(1 + 2a)7? πα l+a —a}]; 
—a —a l+a 
c—1 a-—be ὃ -- a 
(a2 + ὁ + εἷ — 1 — Qabe)" 1} -- οὐ δ᾽ —1 c—ab 
b—ac c—ab a? —-1 


1 oo ὦ 
fe oe ae 1 Ort 9 Ἄν 
2\ 1 .» τι. ; [ow 1 @ } issingular 
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Exercise 13.4b 


2 t,2t¢+ 1 3 —2and +5 
4 Non-trivial solution when a,b, — a,b, # 0 


Exercise 13.4¢ 
3 Using row 1 on p. 42] 
(2) DJTQOAROLR.SPWEOY,EA,KGNJIWFBBFG 
(1) COATISX,GZLDEAZSOWN.RMEJWDAKCDNVTP 
WTMFAUO.RXF 


5 7 
4 (; 7 E.GHEE OKF A 


Exercise 13.5a 

r (7) -ὖ (wa) 54 (1) O (tv) 2670 (v) 2232 (wu) O (vi) 0 

a 

αὐ 5" 

3 (1) 4ab (uw) 1 (tt) cos20 (w) 1 (v) —1 — αὶ 
(ut) (ps — 19) (x — y)z 


a+ qb ra + sb 
οὐκ. ΞΔ — 
ay 


ΤΟΥ͂, 
ἝΝ, d q 
6 A= -1,7 7 x=0,-1 
x 2 (ar —cp)? = (br — cp) (aq — bp) 


2 has factor a — ᾧ 


Ι! 

| 
NO 
iN) 
ων: 

i 


Exercise 13.5b 
(1) —1 (ὦ) —3 (a2) —21 (ὦ) —19320 (v) 0 
Δ - αὐ - δ +? — 3abc,A, A, —A; a? + 63; 1 — 2x3 + x 


2 
4 See alien cofactors, 13.5, p. 000 5 —2,8,9 
6 Any pair of rows or columns are linearly dependent 
7 (1) 2abe (a) 0, (az) O 
8 () 1 - αἢ - δὲ +c? + Qabe=0 (ii) be +cat+ab+1=0 
9 33 
P@qnrs 0 
Opgqnreies 
ir ja b ¢c 0 O;=0 
Oab«e 0 
0 0 « ὁ ς 
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Exercise 13.5¢ 


I 


(1) 276 (1) —ab(a — b)(1 — a)(1 — δ) 

(111) (a+ b+ c)? 

(wv) —(a+b+c)(a + bw + cw?) (a+ bw* + co) 
(a+b+c)\(a + bw + cw”)(a + bw* + cw) 
=(a+b+c)(da’* — ΟΣ be) 


(a—l1)(a +a+1); (1+a+4a’)*(1— a)’; 0 
(1) (ὁ -- α)(α -- ο)(ε -- δὴ) (uw) (a + b+ οὐ (ὦ -- α)(α -- ο) (ὁ — δ) 
Σέ wasup eae 
(v) 0 
—] 6 (2a + b)(a— δ); —(2a+ b)(a—b)*; 0 

0 « 6: 
(i) lc 0 al - 4a7b2c?; (uz) (ὦ -- αὖ (um) --8 

ba 0 
CC eee ree er ee ee ee Ce eee 
(v) 0 

1 2 3 4 

2 3 4 ἢ 
“S13 4 5 6 

} 1 1 1 


Exercise 13.5d 


I 


9 


F α +c¢*% ab +e] ὦ 
esa haa xan 
1 0 Ι | 2 | 
ea | +h |=} ; 
a+b a δ΄ αΖ - εξ. αὐ ας 
αὐ bc +ac be = ab b? + εἶ bc 
ab ab αο- ὁ" ac δὲ α2 - ὁ" 
a p Ol? 
δ g 0] =0 
c r 0 
atx b+yc+2|=|a ὃ οἱ + la b 2] -- |x bc] + |x ὃ 2 τ 


+ |a y 2] + |x » οἱ + |x y 2| 
Must have value of determinant = 1 


Exercise 13.6a 


. 
(i) 33,3: (ὦ) = -Ξ -ς = 5; no solutions 
| a 
(ἐπ) --- 3 


int mp mn—lp [2 + πιξ 


556 


10 
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(1) 3, —5, —1 (see 13.6, Example 1) 
(7) infinity of solutions: 3 x eqn. 1 — 2 Χ eqn. 2 = eqn. 3 (see 13.7, 
PP- 445-6) 
(br — cq)” + (ar — pc)” = (aq — bp)? 
at+t+b+c=O0>%=y=2; or a=b=e>mx+y+z2=0 
A= 3,-1,-1; A=3,x:y:z = 1:0:1; 
A= —l1,x:y:z = 3:4:-9 
3x =atbt+c; 3y=at bw* + cw; 32 =a+ bw + cw’ 
B = -1 (A, -—3/,A + 3) 
k=1,nosolutions; k = — 2, (A, ὅλ + ὃ, — ἃ — 2) 


Exercise 13.6b 


3 


4 


15, 10 
2 a = 2 k= —-5,c¢= -3 


(1) A=O,x:y:z = 1:0:1 
(11) values other than 0 or |: 
21 — 3 l —j? +4, -- 3 
Saga a ee ee ee ee. a τας 
2A(A — 1) 2(A — 1) 2A(A — 1) 
(—3, 6, 2), (2, 1,6); (sin ἃ sin B, —cos ἃ sin B, cos ἃ cos B) 


Exercise 13.7a 


5 
6 


(1) Nosolutions (ii) —44, 19,75 (ai) —1,1,1 (ὦ) 4,0, -4 


—106 111 109 
) “65? 260° 36 
—1,-1,1 4 xiy:ziw = 1:-—2:0:1 
1 2 0 5 0 0 
eg. M,={0 1 0] M,={0 1 —3] det A = 46 
00 1 00 1 


x = 5.23,y = —6.91,z = —2.33 
x = 0.898, y = 0.763, z = 0.618 


Exercise 13.7b 


—14 4-5). 1 d -- 
@) La δὴ 9, i(; “3 ee κί. ἢ 


2 4 1 1 5 3 
(w) singular (0) | 11 —1 —6] (vi) τς] 4 -—4 0 


11 2-5 72 —32 44 
(vii) 24({-4 0 4) (vii) χὲσί 132 -37 -71 
~1 2-1 -60ὺ 70 50 
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2 (i) Nosolutions (ii) —44,19,75 (#i) —1,1,1 
. 9 


6 
6 0 -2 

4 1[{2 -1 1]; —2, 34, 8ὲ 
0-1 3 

6 K=2; x =6 — 5z,y = 3z -- 2; 
X= 2+ 2z,Y 


4 (t) δὲ, —12t,¢ (aw) nosolutions (111) δὲ + 5, —12¢ — 10,¢ 
(2 


Ὁ) no solutions 


8 All points map into the plane 2x ἐν —z=0 


a, ὦ, ε, 2a, — 4d. + ας = 0 0 -2 --| 
9 | a, ὃ. ως, where 20, —46,+6,=0, eg. Ὄ|ιΛἈ||0 0 
a, ὧς C5 26) — 40. + ¢, = 0 4 4 2 
Sa 5b 506 
10 | 3a 35 3c 
2a 2b 2 


11 All2 x 2 minors zero; rows and columns linearly dependent in pairs 


Exercise 13.8 


1 «38 τοῦ 10 
l—el+eQ0 111 —-1 1 0 
no, letM, = 4} l1+e1—e0])]= $]]1 1 0]}t+e 1—-1 0 
0 0 2 00 2 0 0 0 


= 3(A + eB), and similarly M, = 3(C + eD). Work out successive 
3 


products...M,M,M,M,M,, neglecting terms in 6“, e’,... 


Miscellaneous problems 13 
I (x, y, Z) 5’ Ἢ (x, —Z,y); (x,y, Z) Be, (z—l,y, —x +1); 


(x,y, 2) EL, ( τοι, τῷ, τα +1); 
x=y,x=z-—Il1,y=0,z=1. 
Reflections in three planes would have a fixed point 


Closure re =s* =e: Do, asonp.502 
3 3 2» 


86y = 41x + 176 

(3t, 3 — 4t, 2 — 5t) through points (0, 3, 2) and (3, —1, —3) 

Let 1 = (lmn),x = (xyz),a = (abc). Equations state thata = I X x, 
so we must have a, I perpendicular, i.e. al + bm + cn = 0; alsoa, x 
perpendicular, so ax + by + cz = 0. A vector parallel toa Χ [satisfies 
lx + my + nz = 0, i.e. (bn — cm, οἱ — an, am — bl) (1? + m? + n?)~! 


“uO μι ν 
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8 Least 6 (3 zeros in same row or column); greatest 11 (no two zeros in 


same row or column) 


g (abc)~* (ὁ -- ο) (ε -- α) (α -- b) Nath se '(r4+s+tan— 1) 


10 No Ir cos né 
Exercise 14.2a 
1 No 2 (1) Excludedb + d=0 
3 123465, 412365, 
1 4 0 -] ] 
No, e.g. = 
006. (5 )+(; 1 ᾿ 


{M: det M « 0}, x not closed 
{M: det M = 0}, x closed 


(11) exclude Q = 0 
341256, 234156 


3 . 
l 2 


5 No. 120° turns about two altitudes may combine to give ἃ half-turn 


(see 14.4, p. 512) 
6 Include 11 for closure. {1, 13} 


NZZ QQ*RR*C RQ cisOx4+ iy: x,yEZ 


+ 1 21121721271 11 Ὁ Ὁ l 
- 01010101 0 0 ] 
x O 1 01 121 21 «21 «0 1 ] 
+ 00 011 1 1. 9} 10 ] 0 
(zero excluded for x and +) 
8 Multiplication closed, addition not closed 
9 7: 8 δὲ ges i g° 
] x Χ -Ξὄ.} 
(1) es l—x = x ᾿ 
(1) In x + sin x In (sin x) sin (Inx) In (In χ) sin (sin x) 
wa (—3 + Qe — 2x7) x + 3 —x + 3 
i a os a es a τ : 
(iv) 4 9—4% 7-—4% 4-3 44-5 
10 (7) Yes (11) no (11) no (iv) yes 
ee x 
13 
2D Ι 0 0 
3D Ι 0 ] 
14 (1) Closed (22) yes (111) yes provided x # y 
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Exercise 14.2b 


3 (ba)c # blac) 4 (be)d # died) 7 Associative 
eg. k = —1, putx = cota,y = cot B 


Not associative; (0, 0), (a; — a), ἄ. — a,); no. Invalid because asso- 
ciativity assumed. (Bo A) o A* # Bo (Ao A*) 


Exercise 14.2c 
1 &,@;symmetric difference: AA B = (AN B’) ὦ (BN A’) 
i) {2,4,8} x (mod 14) closed (ἢ) 10 


(i) 27 + 13 = 0; 12 + 28 = 0 (mod 40) 
(i) 17 x 13 = 1 (mod 20) (iii) 53 x 17 = 1 (mod 100) 


0. 0.8 
5 (1) 0. (Ὁ 3) rotation) (41) i ( : Ὁ (reflection 


0.8 —0.6 

0-1 0-1 0 6 8 —1 

(222 ) ( 4 (wv) 0 0 1} (Ὁ) [-10 -13 2 
—-1 0 0 7 9 --Ἱ 

τ; cos 80° sin 80° oe 
va 

—sin 80° cos 80°) “HOM 
] τ 2 ως 2Kx— 1) 

6 cS 1/3 
(1) = (11) oa (221 ) a (10) x (v) log, 
: 3x — 7 a3 ax 

(v2 ) 5 4 (v2) a 


a (12345) Gi (123456) (0 (1284567 
7 41523) “/\456132) ’\4751362 


10 Inverse requires a? + ὁ + οἷ — ϑαὖς # O,ie.a - ὁ + ¢ # Oand 
a, b, c not all the same 


Exercise 14.3a 

r (2) 8 (aw) 4 (1) 120 (a) n (v) 8 (uw) 4, 8,6, ὃ 
4 (2) Fails for closure, identity, inverse (2) fails for inverses 
(111) no inverses (10) neither inverses nor identity (v) not closed 
(vt) not closed (viz) include identity (vz) include —1 
(zx) not closed 
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5 (i) b= 'ὶ =e = (7) (i) a(be) # (ab)e 


(111) table (with v and w interchangeable): 


lil p qrs t uv w 
P\lp qt wos r 4 
giq 1 pvuyrwei s 
rjr ¢ vs 1 wgqiu p 
s|s wu lr pov qt 
tit vr wp y ls 4 
uju s wgoué itt pr 
viv τ ¢ uqs pw il 
wlwus pt qrii oo 


8 T(x) = —x + ὁ (reflection in $d) 
10 eg.G=R*,x,g=2 


Exercise 14.3b 
1 32 = 9,3? = 13, 3* = 11, 3° = 5, 3° = 1 (mod 14) 
3 12 4 7*=1(mod 15); {3,6,9, 12} x mod 15 (identity 6). 
5 {4, 8, 12, 16} x mod 20 (identity 16) 
l l ] 
6 ᾿ ] -- χ 1 -- -- oa ot (D,, as on p. 509) 
χ 


l—x χπὶ χα 


7 (1 2 (a) 2 (a2) 4 (ἢ) 10 (wv) 12 
9 Dz, as on p. 509 
10 J, A, B, AB, BA, BAB, ABA, ABAB = BABA (D,, as on p. 511) 
11 No finite additive subgroups of C, except {0} 
12 e.g. generated by (1 2 3 4.56 7); generated by (1 2 3 4 5) (6 7) 
12 C,; 2 of period 3, 6 of period 9; 
C2, 4 of period 12, 2 of period 6, 4 and 3; 1 of period 2 
14 Generated by 2, 6, 11 or 7 
15 (1) Dz 
(ii) eighth symmetry is central inversion, (x, y, Z) +>» (τα, —y, —Z); 
group is C, x Cy X Cp. 
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16 (1,0), (2 ἢ Remove elements having x = 0 


x 


Exercise 14.4a 
I lelei 
—1e9e x C, 
fi, -ἢ ὦ 3, Tho fx, 3} 
4 (1) non-Abelian # Abelian (1) D,, C, 
(11) C,+2V,,+ Ἐ V5, + 
(iv) impossible to set up one-to-one correspondence between R* and Q 
4 Must be infinite 
6 x (ES) — 2πχ (Ε rotations of circle) 
7 C63 {fg} © {r, t} of period 6 
8 {1,2,4,5,7,8}; {1,3,5,7}; {1,3,7,9}; {1,5,7,11}; prime 
10 All six permutations of a, 6, c are automorphisms 


2 aed (r= 1,2,3,4,5,6); οἷβ ἀπ cis qn (r = 1, 3, 5, 7) 


Exercise 14.4b 
r (ἢ δ). (uw) δ, (wi) Dy (w) Cy (v) Dg (wt) OY 
2 


; π : 
4 Mirrors at --, group is δ), 
n 


4 See Ex. 14.3b, no. 10 


5 “τς: fel) =2-= "τὺ D, 
a Sp ἰδ) ΞΞῚ 
we=raseaH=rea=f a jH=ra= d; 
a τὸ γα Ξε ε; a =rn= 


g Nine of period 2, eight of period 3, six of period 4 
10 24 of each 
1. (1) C, (aw) D 


00 


(11) seeno.3 (Ὁ) D, (0) δὼ 


00 


yp) Ne 
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